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Foreword 


FOREWORD 

Permutations have a remarkably rich combinatorial structure. Part of the reason 
for this is that a permutation of a finite set can be represented in many equivalent 
ways, including as a word (sequence), a function, a collection of disjoint cycles, a 
matrix, etc. Each of these representations suggests a host of natural invariants 
(or “statistics”), operations, transformations, structures, etc., that can be applied 
to or placed on permutations. The fundamental statistics, operations, and 
structures on permutations include descent set (with numerous specializations), 
excedance set, cycle type, records, subsequences, composition (product), partial 
orders, simplicial complexes, probability distributions, etc. How is the newcomer 
to this subject able to make sense of and sort out these bewildering possibilities? 
Until now it was necessary to consult a myriad of sources, from textbooks to 
journal articles, in order to grasp the whole picture. Now, however, Miklos Bona 
has provided us with a comprehensive, engaging, and eminently readable 
introduction to all aspects of the combinatorics of permutations. The chapter on 
pattern avoidance is especially timely and gives the first systematic treatment of 
this fascinating and active area of research. 

This book can be utilized at a variety of levels, from random samplings of the 
treasures therein to a comprehensive attempt to master all the material and solve 
all the exercises. In whatever direction the reader’s tastes lead, a thorough 
enjoyment and appreciation of a beautiful area of combinatorics is certain to 
ensue. 


Richard Stanley 
Cambridge, Massachusetts 
January 14, 2004 
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Preface 


A few years ago, I was given the opportunity to teach a graduate Combinatorics 
class on a special topic of my choice. I wanted the class to focus on the 
Combinatorics of Permutations. However, I instantly realized that while there 
were several excellent books that discussed some aspects of the subject, there 
was no single book that would have contained all, or even most, areas that I 
wanted to cover. Many areas were not covered in any book, which was easy to 
understand as the subject is developing at a breathtaking pace, producing new 
results faster than textbooks are published. Classic results, while certainly 
explained in various textbooks of very high quality, seemed to be scattered in 
numerous sources. This was again no surprise; indeed, permutations are 
omnipresent in modern combinatorics, and there are quite a few ways to look at 
them. We can consider permutations as linear orders, we can consider them as 
elements of the symmetric group, we can model them by matrices, or by graphs. 
We can enumerate them according to countless interesting statistics, we can 
decompose them in many ways, and we can bijectively associate them to other 
structures. One common feature of these activities is that they all involve factual 
knowledge, new ideas, and serious fun. Another common feature is that they all 
evolve around permutations, and quite often, the remote-looking areas are 
connected by surprising results. Briefly, they do belong to one book, and I am 
very glad that now you are reading such a book. 

* * * 


As I have mentioned, there are several excellent books that discuss various aspects 
of permutations. Therefore, in this book, I cover these aspects less deeply than 
the areas that had previously not been contained in any book. Chapter 1 is 
about descents and runs of permutations. While Eulerian numbers have been 
given plenty of attention during the last 200 years, most of the research was 
devoted to analytic concepts. Nothing shows this better than the fact that I was 
unable to find published proofs of two fundamental results of the area using 
purely combinatorial methods. Therefore, in this Chapter, I concentrated on 
purely combinatorial tools dealing with these issues. By and large, the same is 
true for Chapter 2. Chapter 3 is devoted to permutations as products of cycles, 
which is probably the most-studied of all areas covered in this book. Therefore, 
there were many classic results we had to include there for the sake of 
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completeness, nevertheless we still managed to squeeze in less well-known topics, 
such as applications of Darroch’s theorem, or transpositions and trees. 

The area of pattern avoidance is a young one, and has not been given significant 
space in textbooks before. Therefore, we devoted two full chapters to it. Chapter 
4 walks the reader through the quest for the solution of the Stanley-Wilf conjecture, 
ending with the recent spectacular proof of Marcus and Tardos for this 23-year- 
old problem. Chapter 5 discusses aspects of pattern avoidance other than upper 
bounds or exact formulae. Chapter 6 looks at random permutations and Standard 
Young Tableaux, starting with two classic and difficult proofs of Greene, Nijenhaus 
and Wilf. Standard techniques for handling permutation statistics are presented. 
A relatively new concept, that of min-wise independent families of permutations, 
is discussed in the Exercises. Chapter 7, Algebraic Combinatorics of Permutations, 
is the one in which we had to be most selective. Each of the three sections of that 
chapter covers an area that is sufficiendy rich to be the subject of an entire book. 
Our goal with that chapter is simply to raise interest in these topics and prepare 
the reader for the more detailed literature that is available in those areas. Finally, 
Chapter 8 is about combinatorial sorting algorithms, many of which are quite 
recent. This is the first time many of these algorithms (or at least, most aspects of 
them) are discussed in a textbook, so we treated them in depth. 

Besides the Exercises, each Chapter ends with a selection of Problems Plus. 
These are typically more difficult than the exercises, and are meant to raise 
interest in some questions for further research, and to serve as reference material 
of what is known. Some of the Problems Plus are not classified as such because 
of their level of difficulty, but because they are less tighdy connected to the topic 
at hand. A solution manual for the even-numbered Exercises is available for 
instructors teaching a class using this book, and can be obtained from the 
publisher. 
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JVo Way Around It Introduction. 


This book is devoted to the study of permutations. While the overwhelming 
majority of readers already know what they are, we are going to define them 
for the sake of completeness. Note that this is by no means the only definition 
possible. 

DEFINITION 0.1 A linear ordering of the elements of the set [n]= {1, 2, 3, —, n } is 
called a permutation, or, if we want to stress t!w fact that it consists of n entries, an n- 
permutation. 

In other words, a permutation lists all elements of [n] so that each element is 
listed exactly once. 

Example 0.2 

If n= 3, then the w-permutations are 123, 132, 213, 231, 312, 321. [j 

There is nothing magic about the set [n], other sets having n elements would 
be just as good for our purposes, but working with [n] will simplify our 
discussion. In Chapter 2, we will extend the definition of permutations onto 
multisets, and in Chapter 3, we will work with an alternative concept of looking 
at permutations. 

For now, we will denote an n-permutations hy p=pip 2 --p„, with p, being the ith 
entry in the linear order given by p. 

The following simple statement is probably the best-known fact about 
permutations. 

PROPOSITION 0.3 

The number of n-permutations is n!. 

PROOF When building an n-pcrmulation pr=pip 2 ■ • we can choose n entries to 

play the role of pi, then n- 1 entries for the role of p 2 , and so on. | 

I promise the rest of the book will be less straightforward. 
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In One Line And Close. Permutations as 
Linear Orders. Runs. 


1.1 Descents 

The “most orderly” of all w-permutations is obviously the increasing 
permutation 123--W. All other permutations have at least some “disorder” in 
them, for instance, it happens that an entry is immediately followed by a smaller 
entry in them. This simple phenomenon is at the center of our attention in this 
Section. 

1.1.1 The definition of descents 

DEFINITION 1.1 Let p=pxp 2 ---p n be a permutation. We say that i is a descent of p if 
P>P i+ 1 ■ Similarly, we say that i is an ascent ofp fpffi+\- 

Example 1.2 

Let /;=3412576. Then 2 and 6 are descents of p, while 1, 3, 4 and 5 are ascents 
of P- □ 

Note that the descents denote the positions within^, and not the entries of p. The 
set of all descents of p is called the descent set of p and is denoted by D(p). The 
cardinality of D(p), that is, the number of descents of p, is denoted by d(p), though 
certain authors prefer des(p). 

This very natural notion of descents raises some obvious questions for the 
enumerative combinatorialist. How many w-permutations are there with a given 
number of descents? How many w-permutations are there with a given descent 
set? If two M-permutations have the same descent set, or same number of descents, 
what other properties do they share? 

The answers to these questions are not always easy, but are always interesting. 
We start with the problem of finding the number of permutations with a given 
descent set S. It turns out that it is even easier to find the number of permutations 
whose descent set is contained in S. 
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LEMMA 1.3 

Let S — {si, S2, • • • Sfc} C [n—lj, and let a (S) be the number of n-permutations whose 
descent set is contained in S. Then we have 

a{S)=( n \{ n - Sl )( n - S ^---(n- Sk ). 

\sij \s 2 - Si J \s 3 - s 2 j \n - s k J 

PROOF The crucial idea of the proof is the following. We arrange our n entries 
into k+\ segments so that the first i segments together have s, entries for each i. 
Then, within each segment, we put our entries in increasing order. Then the 
only places where the resulting permutation has a chance to have a descent is 
where two segments meet, that is, at j,, .s 2 , ■■■, s k . Therefore, the descent set of the 
resulting permutation is contained in S. 

How many ways are there to arrange our entries in these segments? The 
first segment has to have length j b and therefore can be chosen in (”) 
ways. The second segment has to be of length s 2 -s h and has to be disjoint 
from the first one. Therefore, it can be chosen in {fff !.) ways. In general, 
segment i must have length si-Sn if i<lk+ 1, and has to he chosen from the 
remaining n-s v \ entries, in ("Z.* 4 . -1 ) ways. There is only one choice for the 
last segment as all remaining n-s k entries have to go there. This completes 
the proof. | 

Now we are in a position to state and prove the formula for the number of n- 
permutations with a given descent set. 

THEOREM 1.4 

Let S C [n — 1 ]- Then the number of nfermutations with descent set S is 
fS)= £(—l) |s - T| a(S). 

T<S 

PROOF This is a direct conclusion of the Principle of Inclusion-Exclusion. 
(See any textbook on introductory combinatorics, such as [27], for this 
principle.) Note that permutations with a given A-element descent set H are 
counted ah = = (1 + (—times on the 

righthand side of (1.1). The value of a h is 0 except when \S-H\=(), that is, 
when S=H. So the right hand side counts precisely the permutations with 
descent set S. 


1.1.2 Eulerian numbers 

Let A(n, k) be the number of w-permutations with k-1 descents. You may be 
wondering what the reason for this shift in the parameter k is. If^ has k-1 descents, 
then p is the union of k increasing subsequences of consecutive entries. These 
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are called the ascending runs of p. (Some authors call them just “runs,” some others 
call something else “runs.” This is why we add the adjective “ascending” to 
avoid confusion.) Also note that in some papers, A(n, k) is used to denote the 
number of permutations with k descents. 

Example 1.5 

The three ascending runs of/>=2415367 are 24, 15, and 367. D 

Example 1.6 

There are four permutations of length three with one descent, namely 132, 
213, 231, and 312. Therefore, A( 3, 2)=4. Similarly, A( 3, 3)=1 corresponding to 
the permutation 321, and A( 3, 1)=1, corresponding to the permutation 123. 

□ 

Thus the permutations with k ascending runs are the same as permutations with 
k-\ descents, providing one answer for the notation A(n,k). We note that some 
authors use the notation ^ for A(n,k). 

The numbers A(n,k) are called the Eulerian numbers, and have several beautiful 
properties. Several authors provided extensive reviews of this field, including 
Carlitz [57], Foata and Schiitzenberger [89], Rnuth [136], and Charalambides 
[56]. In our treatment of the Eulerian numbers, we will make an effort to be as 
combinatorial as possible, and avoid the analytic methods that probably represent 
a majority of the available literature. We start by proving a simple recursive 
relation. 

THEOREM 1.7 

For all positive integers k and n satisfying k<n, we have 

A(n, k+\)=(k+l)A(n-\, k+\)+(n-k)A(n-\, k). 

PROOF There are two ways we can get an w-permutation p with k descents 
from an (n-1) -permutation p’ by inserting the entry n into p’. Either p’ has k 
descents, and the insertion of n does not form a new descent, or p 'has k-\ descents, 
and the insertion of n does form a new descent. 

In the first case, we have to put the entry n at the end of p’, or we have to insert 
n between two entries that form one of the k descents of p’. This means we have 
k+\ choices for the position of n. As we have A(n-1, k +1) choices for^] the first 
term of the right-hand side is explained. 

In the second case, we have to put the entry n at the front of p’, or we have to 
insert n between two entries that form one of the (n- 2)-{k-l) ascents of p’. This 
means that we have n-k choices for the position of n. As we have A(n- 1, k) choices 
for p\ the second part of the right-hand side is explained, and the theorem is 
proved. 
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We note that A(n, k+1 )=A(n, n-k); in other words, the Eulerian numbers are 
symmetric. Indeed, if p=p\p 2 —p„ has k descents, then its reverse p=p„p n -\ ■' m p\ has 
n-k- 1 descents. 

The following theorem shows some additional significance of the Eulerian 
numbers. In fact, the Eulerian numbers are sometimes defined using this 
relation. 

THEOREM 1.8 

SetA( 0, 0)=1, and Ain, 0)=0 forn> 0. Then for all nonnegative integers n, and for all real 
numbers x, we have 

x n = ’pA(n,k)(^ X + n n k y ( 1 . 2 ) 


Example 1.9 

Let n= 3. Then we have A( 3, 1)=1, A( 3, 2)=4, and A( 3, 3)=1, enumerating the 
sets of permutations (123), {132, 213, 231, 312), and (321). And indeed, we 
have 


* 3 = 



□ 


PROOF (of Theorem 1.8) Assume first that x is a positive integer, Then the left- 
hand side counts the w-element sequences in which each digit comes from the set 
[x]. We will show that the right-hand side counts these same sequences. Let 
a^a^M^-a,, be such a sequence. Rearrange the a into a nondecreasing order 
a' = a*, < «j, < - - - < a* , with the extra condition that identical digits 
appear in a' in the increasing order of their indices. Then ^if—f is an n- 
permutation that is uniquely determined by a. Note that z, tells from which position 
of a the first entry of i comes, 4 tells from which position of a the second entry of 
i comes, and so on. 

For instance, if zsf= 3 11243, then the rearranged sequence is a =112334, leading 
to the permutation j=234165. 

If we can show that each permutation i having k- 1 descents is obtained 
from exactly x+n-k sequences a this way, then we will have proved the 
theorem. 

The crucial observation is that if then z,<z} +1 . Taking 

contrapositives, if j is a descent oip(a)=i 1 i 2 . .. 4, then a*, < This means that 
the sequence Thas to be strictly increasing whenever j is a descent oi p(a). The 
reader should verify that in our running example, i has descents at 3 and 5, and 
indeed, a ’ is stricdy increasing in those positions. 


2004 by Chapman & Hall/CRC 


In One Line And Close. Permutations as Linear Orders. 


1 


How many sequences a can lead to the permutation j=234165? It follows 
from the above argument that in sequences with that property, we must have 

l=fl2=l%=«4<«i=fl6<fl5=X, 

as strict inequality is required in the third and fifth positions. The above chain of 
inequalities is obviously equivalent to 

and therefore, the number of such sequences is clearly 



So this is the number of sequences a for which a =234165. Generalizing this 
argument for any n and for permutations i with k- 1 descents, we get that each 
w-permutation with k-\ descents will be obtained from _ 

(* + n — *) sequences. 

If x is not a positive integer, note that the two sides of the equation to be 
proved can both be viewed as polynomials in the variable x. As they agree for 
infinitely many values (the positive integers), they must be identical. | 

Exercise 7 gives a more mechanical proof that simply uses Theorem 1.7. 

COROLLARY 1.10 

For all positive integers n, we have 

e = ±A(n,k)P + k -'). 

k=0 V 7 

PROOF Replace x by -x in the result of Theorem 1.8. We get 
e{-l? = ±A(n,k)(- X + *- k ). 

fc=0 ' 7 

Now note that 

/—x +n — k\ (—x + n — k)(—x + n — k — 1) • • • {—x +1 — fc) 

V, » / ni 

Comparing these two identities yields the desired result. | 
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The obvious question which probably crossed the mind of the reader by now 
is whether there exists an explicitformula for the numbers A(n, k). The answer to 
that question is in the affirmative, though the formula contains a summation 
sign. This formula is more difficult to prove than the previous formulae in this 
Section. 


THEOREM 1.11 

For all nonnegative integers n and k satisfying k<n, we have 



(1.3) 


i=0 


While this theorem is a classic (it is more than a hundred years old), the present 
author could not find an immaculately direct proof for it in the literature. Proofs 
we did find used generating functions, or manipulations of double sums of 
binomial coefficients, or inversion formulae to obtain (1.3). Therefore, we sollicited 
simple, direct proofs at the problem session of the 15th Formal Power Series and 
Algebraic Combinatorics conference, that took place in Vadstena, Sweden. The 
proof we present here was contributed by Richard Stanley. A similar proof was 
proposed by Hugh Thomas. 

PROOF (of Theorem 1.11) Let us write down k-l bars with k compartments in 
between. Place each element of [n] in a compartment. There are k n ways to do 
this, the term in the above sum indexed by i= 0. Arrange the numbers in each 
compartment in increasing order. For example, if k= 4 and w=9, then one 
arrangement is 


237||19|4568. 


(1.4) 


Ignoring the bars we get a permutation (in the above example, it is 237194568) 
with at most k-l descents. 

There are several issues to take care of. There could be empty compartments, 
or there could be neighboring compartments with no descents in between. We 
will show how to sieve out permutations having either of these problems, and 
therefore, less than k-l descents, at the same time. 

Let us say that a bar is extraneous if 

(a) removing it we still get a legal arrangement, that is, an arrangement in 
which each compartment consist of integers in increasing order, and 

(b) it is not immediately followed by another bar. 

For instance, in (1.4), the second bar is extraneous. Our goal is to enumerate the 
arrangements with no extraneous bars, as these are clearly in bijection with 
permutations with k-l descents. 
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In order to do this, we apply the Principle of Inclusion and Exclusion. Let B, 
be the number of arrangements with at least i extraneous bars, and let B be the 
number of arrangements with no extraneous bars. The Principle of Inclusion 
and Exclusion then tells us that 

B=t-B 1 +B 2 -B 3 +-+{-iyB n . (1.5) 

Let us determine If. Arrangements that have at least one extraneous bar can 
be obtained as follows. Write down the elements of [n] with k-2 bars in 
between, forming k-\ compartments. Then insert an extraneous bar to the 
left of one of the n entries, or at the end, in n+ 1 ways. This shows that 

B 1 = ("+ 1 ) • (k - 1)". 

Similarly, we compute that f ?2 = ("J 1 ) ' (& — 2) n . The only change is that 
this time we start with k-3 bars and k-2 compartments, therefore we have to 
insert two extraneous bars at the end. Continuing this line of reasoning,we get 
that 

Bi = C t x ) ■ (k ~ i)n ’ 

and (1.3) is immediately obtained after we substitute the values of B t into 
(1.5). | 

For the sake of completeness, we include a more computational proof that 
does not need a clever idea as the previous one did. 

First, we recall a lemma from the theory of binomial coefficients. 

LEMMA 1.12 

[Cauchy’s Convolution Formula] Let x and y be real numbers, and let z be a positive integer. 
Then we have 



PROOF Assume first that x and y are positive integers. The the left-hand side 
enumerates the z-element subsets of the set [x+y], while the right-hand side 
enumerates these same objects, according to the size of their intersection with 
the set [x]. 

For general x and y, note that both sides can be viewed as polynomials in x 
and y, and they agree for infinitely many values (the positive integers). Therefore, 
they have to be identical. 


PROOF (of Theorem 1.11) As a first step, consider formula (1.2) with x=l, 
then with x=2 , and then for x=i for i<k. We get 


1 = A(n, 1) • 
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2" = A(n, 2) • 
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n + 1\ 


+ A(n, 1) • 


n 


and so on, the Mi equation being 

h -1 

b n - k ~ •?) 

j=o 

and the last equation being 

k l 

k n = Y^A(n,k-j) 

i =o 


n fn + j~ 1 
n 


z\( n + j- 1 

n 


( 1 . 6 ) 


(1.7) 


We will now add certain multiples of our equations to the last one, so that the 
left-hand side becomes the right-hand side of formula (1.3) that we are trying to 
prove. 

To start, let us add (-1) ("+ 1 ) times the (A-l)st equation to the last one. 
Then add ( n +') times the (i-2)nd equation to the last one. Continue this way, 
that is, in step i, add (-1) ("I" 1 ) times the (k-i)xh equation to the last one. This 
gives us 




*=0 


n - 1-1 


( k-i) n = J2 A (n,j)^2 
j= 1 


i =0 


n + k — i 
n 


(-i)h 


( 1 . 8 ) 

The left-hand side of (1.8) agrees with the right-hand side of (1.3). Therefore, 
(1.3) will be proved if we can show that the coefficient a(n,j) of A(n,j) on the 
right-hand side above is 0 for j^k. It is obvious that the a(n, k )=1 as A(n, k) occurs 
in the last equation only. 

Set b=k-j. Then a(n, k) can be transformed as follows. 

^EM>tr)rr> 


Recalling that for positive x, we have ( x \ = ( a;+a 1 )(—l) a , and noting that 
(T)'— (T)^ 2 ', this yields 




where the last step holds as b=k-j> 0, and the next-to-last step is a direct application 
of Cauchy’s convolution formula. 

This shows that the right-hand side of (1.8) simplifies to A(n, k), and proves 
our Theorem. | 
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n=0 




1 


n=l 



0 


1 

n=2 


0 


1 

1 

n=3 


0 

1 


3 1 

n=4 

0 

1 


7 

6 

n—5 

0 

1 

15 


25 10 


FIGURE 1.1 

The values of S(n, k) for n<5. Note that the Northeast-Southwest diagonals contain values 
of S(n, k) for fixed k. Row n starts with S(n, 0). 

1.1.3 Stirling numbers and Eulerian numbers 

A partition of the set [n] into r blocks is a distribution of the elements of [n] 
into r sets B u B 2 , ■■■, B, so that each element is placed into exactly one 
block. 

Example 1.13 

Let w=7 and /=3. Then {1, 2, 4}, {3, 6}, {5}, {7} is a partition of [n] into r 
blocks. Q 

Note that neither the order of blocks nor the order of elements within each block 
matters. That is, (4,1,2}, (6,3}, {5}, {7} and (4,1, 2}, {6,3}, {7}, {5} are considered 
the same partition as the one in example 1.13. 

DEFINlTlOJV 1.14 The number ofpartitions of [n] into r blocks is denoted by S(n, k) and 
is called a Stirling number of the second kind. 

By convention, we set S{n, 0)=0 if n> 0, and 5(0, 0)=1. The next chapter will 
explain what the Stirling numbers of the first kind are. 

Example 1.15 

The set [4] has six partitions into three parts, each consisting of one doubleton 
and two singletons. Therefore, 5(4, 3)=6. Q 

Whereas Stirling numbers of the second kind do not directly count permutations, 
they are inherently related to two different sets of numbers that do. One of them 
is the set of Eulerian numbers, and the other one is the aforementioned set of 
Stirling numbers of the first kind. Therefore, exploring some properties of the 
numbers S(n, k) in this book is well-motivated. See Figure 1.1 for the values of 
S(n, k) for n<5. 

See Exercises 8 and 14 for two simple recurrence relations satisfied by the 
numbers S(n, k). It turns out that an explicit formula for these numbers can be 
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proved without using the recursive formulae. 

LEMMA 1.16 

For all positive integers n and r, we have 

i=Q ' ' 

PROOF Note that an ordered partition of n into r blocks is just the same as 
a surjection from [n] to [r], To enumerate all such surjections, let A, be the set 
of functions from [n] into [r] whose image does not contain i. The function f: 
[ n ]~*[ r ] i s a surjection if and only if it is not contained in Ai U A 2 U A r , and 
our claim follows by a standard application of the Principle of Inclusion- 
Exclusion. | 

Stirling numbers of the second kind and Eulerian numbers are closely related, as 
shown by the following theorem. 

THEOREM 1.17 

For all positive integers n and r, we have 

= (1.9) 

PROOF Multiplying both sides by r! we get 

r\S(n, r) = ^ A(n, k) (*\ _ k \ . 

k =0 ' ' 

Here the left-hand side is obviously the number of ordered partitions, (that is, 
partitions whose set of blocks is totally ordered), of [n] into r blocks. We will now 
show that the right-hand side counts the same objects. Take a permutation p 
counted by A(n, k). The k ascending runs of p then naturally define an ordered 
partition of [n ]into k parts. If k=r, then there is nothing left to do. If hddr, then we 
will split up some of the ascending runs into several blocks of consecutive elements, 
in order to get an ordered partition of rblocks. As we currently have k blocks, we 
have to increase the number of blocks by r-k. This can be achieved by choosing 
r-k of the n-k “gap positions”, (gaps between two consecutive entries within the 
same block). 

This shows that we can obtain ^Cfe=o A(n, &) (”-&) ordered partitions of [n] 
into rblocks by the above procedure. It is straightforward to show that each such 
partition will be obtained exacdy once. Indeed, if we write the elements within 
each block of the partition in increasing order, we can just read the entries of the 
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ordered partition left to right and get the unique permutation having at most r 
ascending runs that led to it. We can then recover the gap positions used. This 
completes the proof. I 


Inverting this result leads to a formula expressing the Eulerian numbers by the 
Stirling numbers of the second kind. 


COROLLARY 1.18 

For all positive integers n and k, we have 

k 

A(n, k) = ^2 S(n,r) 


r =1 




( 1 . 10 ) 


PROOF Let us consider formula (1.9) for each r<k, and multiply each by r\. We 
get the equations 


1! ■ S(n, 1) = A(n,l) 


n — 1 
0 


2! • S(n, 2) = A(n, 1) ~ ^ + A(n, 2) Q 2 

the equation for general r being 

r\ ■ S(n,r) = Y2 A (n,i)( n _ l . \ 

i= l ' r 

and the last equation being 

k 

k\ ■ S(n, k) = A(n, i) 


i =1 


n — i 
r — i 


( 1 . 11 ) 


( 1 . 12 ) 


Our goal is to eliminate each term from the right-hand side of (1.12), except for 
the term A(n,k) (^2*)=A(n,k). We claim that this can be achieved by multiplying 

(1.11) by (-l) fcr (£~J|), doing this for all r £ [k — 1], then adding these equations to 

( 1 . 12 ) . 

To verify our claim, look at the obtained equation 

t so. rH-w?:;)=£<-W:::) t ^ o (:::). 


r =1 


r =1 


i=l 


or, after changing the order of summation, 
t S C* ->!(-!)*- (I A(n, i) (l Z *■) ±(-1) 


(1.13) 


r=1 


i =1 


k—r 


r=l 


n — r 
k — r 


(1.14) 
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whose left-hand side is identical to the right-hand side of (1.10). 

It is obvious that the coefficient of A(n, k) on the right-hand side is =1. 

Therefore, our statement will be proved if we can show that the coefficient t(n, i) 
of A(n, i) in the last expression is equal to zero if z<T. 

Note that (”~?) = 0 if r<i. Therefore, for any fixed i<k, we have 


k 

t(n, i) = YL 

r—i 



k — i — 1 
k — i 



(_l)fc-r 


= 0. 



/'k — n — 1\ 
\ k — r ) 


We used Cauchy’s convolution formula (Lemma 1.12) in the last step. This 
proves that if i<&, then A(n, i) vanishes on the right-hand side of (1.14). We have 
discussed that A(n, k) will have coefficient 1 there, (and this can be seen again by 
setting k=i in the last expression, leading to t(n,i) = ("q 1 ) = 1), so (1.14) implies 
the claim of this Corollary. | 


1.1.4 Generating functions and Eulerian numbers 

The various generating functions of the Eulerian numbers have several interesting 
properties. Let us start with a finite version. 

DEFINITION 1.19 For all nonnegative integers n, the polynomial 

n 

A n (x) = YY A(n, k)x k 

k-1 


is called the nth Eulerian polynomial. 

The Eulerian polynomials have several interesting properties that can be proved 
by purely combinatorial means. We postpone the study of those properties until 
the next Subsection. For now, we will explore the connection between these 
polynomials and some infinite generating functions. 

THEOREM 1.20 

For all positive integers n, tlw nth Eulerian polynomial has the alternative description 

A n (x) = (l-x) n+l Y2i n * i - 

i> 0 

Note that Euler first defined the polynomials A„(x) in the above form. 
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Example 1.21 

For n=l, we have 

A\ (x ) = (1 — x) 2 ^ ix 1 

i> 0 




and for n= 2, we have 
A 2 (x) = (1 — x) 3 ^ i 2 x l = (1 — x) 3 

□ 


*>0 


2x 2 


+ 


(1 — a;) 3 (1 — x) 


x + x . 


PROOF Let us use formula (1.3) to write the Eulerian polynomials as 


A(n,k)x k =J2 E (- 1 )* 


i (n + 1 


k =1 0 <i<k 


(k - i) n x k 


= E (E (-') 

k =1 \0 <i<k 


k -^ n + l \ n x k 
K %) 


Changing the order of summation, and noting that the sum in parentheses, being 
equal to A(n, k), vanishes for k~>n, we get 

E ■ E (l t () (-*)*-* = a - *)” + ' E 


i>0 k>i 


i> 0 


It is often useful to collect all Eulerian numbers A(n, k) for all n and all k in a 
master generating function. This function turns out to have the following simple 
form. 

THEOREM 1.22 

Letr(x, u) = J2 n >o rLk>o 4(n, k)x k Ar ■ Then we have 

, ^ 1 ~t 

rtx.u) = -- 7 Z - 77. 

1 - te u O-t) 

PROOF Using the result of Theorem 1.20, we see that 


<*,„) = E a - *>" +1 E « U = d-») E E 


n> 0 


i> 0 


i>0 n>0 




,iu(l—x) _ 


1-1 


i> 0 


1 - te u O-t)' 
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n=l 



1 



n=2 



1 

1 


n=3 


1 

4 

1 


n=4 

1 


11 

11 

1 

n=5 

1 

26 

66 

26 


n-6 

1 57 


302 

302 

57 


FIGURE 1.2 

Eulerian numbers for n<6. Again, the NE-SW diagonals contain the values of A(n, k) for 
fixed k. Row n starts with A(n, 1). 


■ 

1.1.5 The sequence of Eulerian numbers 

Let us take a look at the numerical values of the Eulerian numbers for small n, 
and ^=0, 1, n- 1. The wth row of Figure 1.2 contains the values of A(n, k), for 
l<k<n, up to n=6. 

We nodce several interesting properties. As we pointed out before, the sequence 
A(n, k) is symmetric for any fixed n. Moreover, it seems that these sequences first 
increase steadily, then decrease steadily. This property is so important in 
combinatorics that it has its own name. 

DEFINITION 1.23 We say that the sequence of positive real numbers a h a 2 , a n is 

unimodal f there exists an index k such that 1 <k<n, and ai<av<aj>a k+ {>a n . 

The sequences A(n, Aj {1 < 4 <„ } seem to be unimodal for any fixed n. In fact, they 
seem to have a stronger property. 

DEFINITION 1.24 We say that the sequence of positive real numbers a h a 2 , —, a„ is log- 
concave fak-icik+i f holdsfor all indices k. 

PROPOSITION 1.25 

If the sequence a u <%, a x fpositive real numbers is log-concave, then it is also unimodal. 

PROOF The reader should find the proof first, then check the proof that we 
provide as a solution for Exercise 5. I 

The conjecture suggested by our observations is in fact correct. This is the content 
of the following theorem. 
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THEOREM 1.26 

For any positive integer n, the sequence A(n, k)[i£*£ n ) of Eulerian number is log- 
concave. 

While this result has been known for a long time, it was usually shown as a 
corollary to a stronger, analytical result that we will discuss shordy, in Theorem 
1.33. Direct combinatorial proofs of this fact are more recent. The proof we 
present here was obtained by Bona and Ehrenborg [28] who used an idea of 
Gasharov. 

If a path on a square grid uses steps (1, 0) and (0, 1) only, we will call it a 
northeastern lattice path. 

Before proving the theorem, we need to set up some tools, which will be 
useful in the next section as well. We will construct a bijection from the set A(n, 
k) of w-permutations with k descents onto that of labeled northeastern lattice 
paths with n edges, exactly k of which are vertical. (Note the shift in parameters: 

| A(n, k)\=A{n, k +\), but this will not cause any confusion.) 

Let V{n) be the set of labeled northeastern lattice paths that have edges a h itr h 
..., a„ and that corresponding positive integers e h e 2 , ..., e n as labels, so that the 
following hold: 

(i) the edge a± is horizontal and e t =l, 

(ii) if the edges a,- and a, + 1 are both vertical, or both horizontal, then 
e,>e l+u 

(iii) if a, and a i+1 are perpendicular to each other, then e,Te M <z+L 

At this point, the starting point of a path in V(n) has no additional significance. 
Let 'Pi n. k) be the set of all lattice paths in Pin) which have k vertical edges, and 
let P( n ,fc) = \P(n,k)\- 

PROPOSITION 1.27 

The following two properties of paths in P(n) are immediate from the definitions. 

• For all i>2, we have e<z-l. 

• Fix the label e r If e M can take value t), then it can take all positive integer values 
zv<V. 

Also note that all restrictions on <?,+, are given by e„ independently of preceding e p 
j<i. Now we are going to explain how we will encode our permutations by these 
labeled lattice paths. 

LEMMA 1.28 

The following description defines a bijection from A(n) onto P(n), where A(n) is the set of 
all n-permutations. Let p g ,4(n). To obtain the edge a,- and the label eifor 2<i<n, 
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restrict the permutation p to the i first entries and relabel the entries to obtain a permutation 
q=qvq< of [if. 

• If the position i-\ is a descent oft!w permutation p (equivalently, ofthe permutation q), let 
the edge a, be vertical and t!w label e t be equal to q t . 

• If the position i -1 is an ascent of the permutation p, let the edge a , be horizontal and the 
label <?,- be i+l-q,. 

Moreover, this bijection restricts naturally to a bijection between A(n, k) and V(n , k)for 
§<k<n-\. 


PROOF 

The described map is clearly injective. Assume that i-1 and i are both descents of 
the permutation p. Let q, respectively r, be the permutation when restricted to 
the i, respectively z'+l, first elements. Observe that y is either r, or r,-l. Since 
rfr l+x we have q>r i+1 and condition (ii) is satisfied in this case. By similar reasoning 
the three remaining cases (based on zT and z being ascents or descents) are shown, 
hence the map is into the set P(n). 

To see that this is a bijection, we show that we can recover the permutation p 
from its image. To that end, it is sufficient to show that we can recover p n , and 
then use induction on n for the rest of p. To recover p„ from its image, simply 
recall that p n is equal to the label l of the last edge if that edge is vertical, and to 
n+l-l if that edge is horizontal. Conditions (ii) and (iii) assure that this way we 
always get a number between 1 and n for p n . | 

See Figure 1.3 for an example of this bijection. 

Now we are in position to prove that the Eulerian numbers are log-concave. 

PROOF (of Theorem 1.26). We construct an injection 

$ : 'P(n, k— 1) x V(n, k + 1) —> V{n, k) x V(n, k). 

This injection <!> will be defined differently on different parts of the domain. 

Let (P, Q ) £ V(n, k — 1 ) x V(n, k + 1 ) Place the initial points of P and 
Q_at (0, 0) and (1, -1), respectively. Then the endpoints of Pand Q^are (n- 
k+\, k-1) and (n-k, k), respectively, so Pand Qjntersect. Let Xbe their first 
intersection point (we order intersection points from southwest to 
northeast), and decompose P = Pi U P% and Q = Qi U Q 2 , where Pi is a 
path from (0, 0) to X, P 2 is a path from X to (n-k, k), Qj, is a path from (1, 
-1) to X, and Qj, is a path from X to (n-k+ 1, PI). Let a, b, c, d be the labels 
of the four edges adjacent to X as shown in Figure 1.5, the edges AX and 
XB originally belonging to P and the edges CX and XD originally 
belonging to £1 Then by condition (ii) we have a>b and c>d. (It is possible 
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FIGURE 1.3 

The image of the permutation 243165. 


that these four edges are not all distinct; A and C are always distinct as X is the 
first intersection point, but it could be, that B=D and so BX=DX; this singular 
case can be treated very similarly to the generic case we describe below and is 
hence omitted). Let P 1 — P 1 \J Q 2 and let Q 1 - Qi A IP 

• If P'and Qjare valid paths, that is, if their labeling fulfills conditions (i)-(iii), 
then we set Q>(P, Q) =(P’, QJ. See Figure 1.4 for this construction. This way 
we have defined ® for pairs (P, Q) g V(n, k) x V(n, k) in which a+d<i 
and b+c<i, where i-\ is the sum of the two coordinates of X. We also point 
out that we have not changed any labels, therefore in (P’, QJ we still have 
a>h and c>d, though it is no longer required as the edges in question are no 
longer parts of the same path. 

It is clear that $(P, Q) = (P',Q') e V(n,k) x V(n,k ), (in particular, 
(P’, QJ belongs to the subset of V{n. k) x V{n. k) consisting of intersecting 
pairs of paths), and that <J> is one-to-one. 

• What remains to be done is to define O (P, QJ_ for those (P, Q) e V(n, k — 
1 ) x P{n,k+ 1 ) for which it cannot be defined this way, that is, when either 
a+cL>i or b+c~>i holds. 

Change the label of the edge AX to i-c and change the label of the 
edge CX to i-a as seen in Figure 1.6, then proceed as in the previous 
case to get <J>(2j Q}=(P’, QJ, where P' = Pj U Q 2 and Q 1 - Q 1 u P 2 . 
We claim that P’ and Qj are valid paths. Indeed we had at least one 
of a+di>i and b+cl>i, so we must have a+ci>i as a>b and c>d. 
Therefore, i-aXc and i-cXa, so we have decreased the values of the 
labels of edges AX and CX, and that is always possible as shown in 
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1 - 1 


1 1 


1 1 





FIGURE 1.4 

The new pair of paths. 



FIGURE 1.5 

Labels around the point X. 
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FIGURE 1.6 

New labels around the point X. 


Proposition 1.27. Moreover, no constraints are violated in P’ and Cf by the 
edges adjacent toXas i-c+d< i and i-a+b<i. It is also clear that O is one-to-one 
on this part of the domain, too. Finally, we have to show that the image of 
this part of the domain is disjoint from that of the previous part. This is true 
because in this part of the domain we have at least one of a+d>i and b+ct>i, 
that is, at least one of i-t.Kb and i-a<A, so in the image, at least one of the pairs 
of edges AX, XB and CX, XD does not have the property that the label of 
the first edge is at least as large as that of the second one. And, as pointed 
out in the previous case, all elements of the image of the previous part of the 
domain do have that property. 

The map <J> we created is an injection. This shows that A(n, k-\)A(n, k+\)<A(n, 
k) 2 , so our theorem is proved. | 

There is a property of sequences of positive real numbers that is even stronger 
than log-concavity. 

DEFINITION 1 .29 Let «!, a 2 > •" be a sequence of positive real numbers. We say that 
this sequence has real roots only or real zeros only f the polynomial y)., cnx 1 has 
real roots only. 

We note that sometimes the sequence can be denoted a,, «i, ■■■, a m and sometimes 
it is better to look at the polynomial ff =() aix 1 (which, of course, has real roots 
if and only if y”_ 0 a, x t+1 does). 

Example 1.30 

For all positive integers n, the sequence Oq, a u «„ defined by a* = (") has real 
zeros only. 
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PROOF We have 'ffJi=a a i xi = Ylif o (l) 3 '* = (1 + Jc) w , so all roots of our 
polynomial are equal to -1. 

Having real zeros is a stronger property than being log-concave, as is shown by 
the following theorem of Newton. 

THEOREM 1.31 

If a sequence of positive real numbers has real roots only, then it is log-concave. 

PROOF Let oo, 0 i, ■■■, a k be our sequence, and let P{x) — Sfc=o ■ 
Then for all roots (x, y) of the polynomial Q(x,y) = t ^ ie 

ratio (x/y) must be real. (Otherwise x/y would be a non-real root of P(x)). 
Therefore, by Rolle’s theorem, this also holds for the partial derivatives 
dQ/dx and dQ/dy. Iterating this argument, we see that the polynomial 
d“ +b Q/dx“dy b also has real zeros, if a+b<n- 1. In particular, this is true in the 
special case when a—j- 1, and b=n-j- 1, for some fixed j. This implies that 
the quadratic polynomial R(x,y) = d n ~ 2 Q / dx 3 ~ l dy n ~ 3 ~ x has real roots 
only, and therefore the discriminant of R(x, y) is non-negative. On the 
other hand, we can compute R(x, y) by computing the relevant partial 
derivatives. Note that we only have to look at the values of k ranging 
from j -1 to j +1 as all other summands of Qfx, y) vanish after derivation. 
We get 

R{x,y) = a^i-(j-iy.^{n-j + l)\y 2 +a j jl(n-j)lxy+a j+ i{n-j~l)\^(j + l)\ 

As we said, this polynomial has to have a non-negative discriminant, meaning 
that 

9 7 + 1 n — 7 + 1 

+ P : ’ : + ’l+y,i - (1.15) 

1 J n- j v ' 

which is stronger than our original claim, a 2 > aj—iUj+i. I 

The alert reader has probably noticed that by (1.15), a log-concave sequence, 
does not necessarily have real zeros only. For instance, the sequence 1, 1, 1 is 
certainly log-concave, but l+x+x 2 has two complex roots. 

One might ask why we would want to know whether a combinatorially defined 
sequence has real zeros or not. In certain cases, proving the real zeros property is 
the only, or the easiest, way to prove log-concavity and unimodality. In some 
cases, unimodality and log-concavity can be proved by other means, but that 
does not always tell us where the maximum or maxima of a given sequence is, or 
just how many maxima the sequence has. Note that a constant sequence is always 
log-concave, so a log-concave sequence could possibly have any number of 
maxima. The following Proposition shows that in a sequence with real zeros 
only, the situation is much simpler. 
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PROPOSITION 1.32 

If the sequence {a t } 0 < k <„ has real zeros only, then it has either one or two maximal 
elements. 

PROOF Formula (1.15) shows that in such a sequence, the ratio «,+, /«, strictly 
decreases, so it can be equal to 1 for at most one index j. | 

Theorem 3.25 will show how to find the maximum (or maxima) of a sequence 
with real zeros. 

The following theorem shows that Eulerian numbers have this last, stronger 
property as well. 

THEOREM 1.33 

For any fixed n, the sequence [A (n, k) } /t of Eulerian numbers has real roots only. In other 
words, all roots of the polynomial 

n 

A n (x) = ^ A(n, k)x k 

k=l 


are real. 

Recall that the polynomials Afx) of Theorem 1.33 are called the Eulerian polynomials. 
This theorem is a classic result, but surprisingly, it is not easy to find a full, self- 
contained proof for it in the literature. The ideas of the proof we present here are 
due to Herb Wilf and Aaron Robertson. 

PROOF Theorem 1.7 implies 

Afx) = (x - x 2 )A' n _ 1 (x) + nxA„-i(x) (n > 1; A 0 {x) = x). 

Indeed, the coefficient of x t on the left-hand side is A(n, k), while the coefficient of 
x' ! on the right-hand side is 

kA(n-l, k)-(k-l)A(n-l, k-l)+nA(n-l, k-V)= 

kA{n-\, k) +(n-k+\)A(n-\, k-l)=A(n, k). 

Now note that the right-hand side closely resembles the derivative of a product. 
This suggests the following rearrangement: 

Afx) =x(l ~ x ) n+1 -f^ K 1 ~x)~ n A n - i(ar)} (1.16) 

with ri>\ and A 0 (X)=x. 

The Eulerian polynomial A 0 (X)=x vanishes only at x=0. Suppose, 
inductively, that A n .i(x) has n -1 distinct real zeros, one at x=0, and the others 
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negative. From (1.16), or otherwise, A n (x) vanishes at the origin. Further, by 
Rolle’s theorem, (1.16) shows that A„(x) has a root between each pair of 
consecutive roots of A„. 1 (x). This accounts for n- 1 of the roots of g„(x). Since 
we have accounted for all but one root, the remaining last root must be real 
since complex roots of polynomials with real coefficients come in conjugate 
pairs. | 

Eulerian numbers can count permutations according to properties other than 
descents. Let p=p 1 p 2 ---p„ be a permutation. We say that zis an excedance of p 
if /,-X 


Example 1.34 

The permutation 24351 has three excedances, 1, 2, and 4. Indeed, /]=2>1, 
p 2 = 4>2, and / 4 =5>4. D 

THEOREM 1.35 

The number of n-permutations with k-1 excedances is A (n, k). 

We postpone the proof of this theorem until Section 3.3.2, where it will become 
surprisingly easy, due to a different way of looking at permutations. We mention, 
however, that if f: S n — >N is a function associating natural numbers to 
permutations, then it is often called & permutation statistic. If a permutation statistic 
/’has the same distribution as the statistic “number of descents”, that is, if for 
all k G [n], we have 

\{p G S n \f(p) = k} = \{p G S n \d(p) = k}\, (1.17) 

then we say that f is an Eulerian statistic. So Theorem 1.35 says that “number of 
excedances,” sometimes denoted by exc, is an Eulerian statistic. We will see further 
Eulerian statistics in the Exercises section. 


1.2 Alternating runs 

Let us modify the notion of ascending runs that we discussed in the last section. 
Let p=pipr"p n be a permutation. We say that / changes direction at position z if 
either P t \ </>/’+!, or P f] . In other words, p changes directions when/, is 
either a peak or a valley. 

DEFINITION 1.36 We say that p has k alternating runs if there are k-\ indices i so 
that p changes direction at these positions. 


2004 by Chapman & Hall/CRC 



In One Line And Close. Permutations as Linear Orders. 


25 



FIGURE 1.7 

Permutation 3561247 has three alternating runs. 


For example, p =3561247 has 3 alternating runs as p changes direction when 
i=3 and when f=4. A geometric way to represent a permutation and its alternating 
runs by a diagram is shown in Figure 1.7. The alternating runs are the line 
segments (or edges) between two consecutive entries where p changes direction. 
So a permutation has k alternating runs if it can be represented by k line segments 
so that the segments go “up” and “down” exactly when the entries of the 
permutation do. 

The origins of this line of work go back to the nineteenth century. More 
recently, D.E.Knuth [136] has discussed the topic in connection to sorting and 
searching. 

Let G(n,k) denote the number of w-permutations having k alternating runs. 
There are significant similarities between these numbers and the Eulerian 
numbers. For instance, for fixed n, both sequences have real zeros only, and both 
satisfy similar recurrence relations. However, the sequence of the G(n, k) is not 
symmetric. On the other hand, almost half of all roots of the generating function 
G n {x) = '^ /pe s n x rl ' p '> = '^2 k>1 G(n , k)x k are equal to T. Here r(p) denotes the 
number of alternating runs of p. 

First we prove a simple recurrence relation on the numbers G(n, k), which 
was first proved by Andre in 1883. 
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2 12 10 

2 28 58 32 

2 60 236 300 122 


FIGURE 1.8 

The values of G(n, k) for n< 6. The first value of row n is G(n, 1). The NE-SW diagonals 
contain the values of G(n, k) for fixed k. 

LEMMA 1.37 

For positive integers n and k we have 

G(n, k)-kG(n- 1, k)+2G{n-l, k-l)+(n-k)G{n-l, k-2), (1.18) 

where we set G(l, 0)=1, and G(l, A)=0 for £>0. 

PROOF Let p be an {n-1 )-permutation having k alternating runs, and let us try 
to insert n into p without increasing the number of alternating runs. We can 
achieve that by inserting n at one of k positions. These positions are right before 
the beginning of each descending run, and right after the end of each ascending 
run. This gives us kG{n-l, k) possibilities. 

Now let q be an {n-1 )-permutation having k T alternating runs. We want 
to insert n into q so that it increases the number of alternating runs by 1. We 
can achieve this by inserting n into one of two positions. These two positions 
are very close to the beginning and the end of q. Namely, if q starts in an 
ascending run, then insert n to the front of q, and if q start in a descending 
run, then insert n right after the first entry of q. Proceed dually at the end of 
the permutation. 

Finally, let r be an (n-1)-permutation having k-2 alternating runs, and 
observe that by inserting n into any of the remaining n-{k-2)-2=n-k positions, 
we increase the number of alternating runs by two. This completes the 
proof. I 

The first values of G(n, k) are shown in Figure 1.8 for n<6. 

Looking at these values of G(n, k), we note they are all even. This is easy 
to explain as p and its reverse always have the same number of alternating 
runs. 

Taking a second look at the polynomials G n (x), we note that Gfx)={x+1) 
(10x 2 +2x), and that G5(x)=32x 4 +58x 3 +28x 2 +2x=(x+l) (32x 3 +26x 2 +2x). Further 
analysis shows that GfX) and G ? (x) are divisible by (x+1) 2 , and that G s (x) and 
Gg(x) are divisible by (x+1) 3 , and so on. In general, it seems that for any positive 
integer ?z>4, the polynomial G n (x) is divisible by (x+l) K ”' 2>/2) . 
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This is an interesting observation, and one which is certainly of combinatorial 
flavor. For instance, if we just wanted to prove that G„(x) is divisible by x+1, we 
could proceed as follows. We could arrange our permutations into pairs, so 
that each pair consists of two permutations, one with r alternating runs, and 
one with r+1 alternating runs. If we could do that, that would imply that 
G„(x)=(l+x)F„(x). Here F„(x) is the generating function by the number of 
alternating runs for the set of permutations that consists of one element from 
each pair, the one with the smaller number of alternating runs. If we can 
appropriately “iterate” this argument, then we will succeed in proving that 
Gjx) is divisible by a power of (1+x). 

Before we start proving the claim that T is a root of G n (x) with a high 
multiplicity, we point out that one might also wonder whether the polynomials 
G n (x)/2(x+iy have some natural combinatorial interpretation for each index 
j < \(n — 2)/2J. Our proof provides such an interpretation. In order to give 
that proof, we need the following definitions that were first introduced in 


[28]. 


DEFINITION 1.38 For j < m = |_(n — 2)/2j, we say that p is aj-half- 
ascending permutation if, for all positive integers i<j, we have p n +i. 2 i < fn+ 2 - 2 i- fj~ m > 
then we will simply say thatp is a half-ascending permutation. 

So p is a 1-half-ascending permutation if p n .i<p n . In a y-half-ascending 
permutation, we have j constraints, and they involve the rightmost j disjoint 
pairs of entries. We call these permutations half-ascending because at least 
half of the involved positions are ascents. 

Now we define a modified version of the polynomials G n (x) for /half-ascending 
permutations. As we will see, one of these polynomials will provide the desired 
combinatorial interpretation for G n (x)/(l+x) m . 

DEFINITION 1.39 Let p be a (j-\-\)-haf-ascendingpermutation. Letrj(p) be the number 
of alternating runs of the substringp h p 2 , ..., p n . 2 j , and let Sj(p) be the number of descents of the 
substring p n _ 2 j, p n+l _ 2 j, ..., pn. Denote tj(p)=rj(p)+Sj(p), and define 



pES, 


So in other words, G„. enumerates the alternating runs in the non-half-ascending 
part and the first two elements of the half-ascending part, and we count the 
descents in the rest of the half-ascending part. 
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LEMMA 1.40 

For all ri>4 and l<j<m, we have 


G n (x) 

2(x + 1)1 


Gn,j (^)> 


where m = [(n - 2)/2J- 

PROOF We prove the statement by induction on j. Let j= 1. Clearly, we can 
restrict our attention to the set of permutations in which />„,•.</v 2 . Indeed, if p 
does not satisfy that condition, then its complement pf will, and vice versa, (where 
pf is the w-permutation whose zth entry is n+l-pf and/> and pf- certainly have the 
same number of alternating runs. 

Let /be the involution acting on the set of all w-permutations (with p n .i<p n f) 
that swaps the last two entries of each permutation. For instance, 
/(5613427)=5613472. It is then straightforward to verify that /either increases 
the number of alternating runs by one, or it decreases it by one. Therefore, / is 
just the involution we were looking for. Indeed, we have 

\G n {x) = xT(p) + xAp)+1 = (® + - 

P(p) P(p) 

where P ranges through all d/4 pairs created by the involution I, and P(p) is the 
permutation in /’that has the smaller number of alternating runs. By verifying all 
(essentially, two, see the example below) possible cases, we see that for all these 
d/4 permutations p, the following occurs. The number r(p<) equals the number 
t\ (cf) of the permutation P(q) in the pair /That is in the same pair as p and ends in 
an ascent. Therefore, the last equality implies 

\G n (x) = ^2(x + l)x tlM = (x + l)G n< i(x), 

P(q) 

where P again ranges the d/4 pairs created by I. Therefore, the initial case is 
proved. 

Figure 1.9 shows the twelve 4-permutations for which ps<p 2 holds, in pairs 
formed by /. The values r(p) and /, (p) are shown as well. One then verifies 
that in each of these pairs, the permutation with the smaller number of 
alternating runs has a number of alternating runs equal to the t\ (p)-v alue of 
the element of that pair in which p^G-ps- This argument carries over for d>4, 
too, for it is only the last four elements where the number of alternating runs 
can be affected by /. 

Now suppose we know the statement for /I and prove it for j. Apply /to 
the two rightmost entries of our permutations to get pairs as in the initial 
case, and apply the induction hypothesis to the leftmost n-2 elements. By the 
induction hypothesis, the string of the leftmost n-2 elements can be replaced 
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1234 

1243 

r(p)=l t,(p)=l 

r(p)=2 

1324 

1342 

r(p)=3 t,(p)=2 

r(p)=2 

1423 

1432 

r(p)=3 tj(p)=2 

r(p)=2 

2314 

2341 

r(p)=3 tj(p)=2 

r(p)=2 

2413 

2431 

r(p)=3 t 1 (p)=2 

r(p)=2 

3412 

3421 

r(p)=3 tj(p)=2 

r(p)=2 


FIGURE 1.9 

The values of r(p) and tf) for rc= 4. 


by a /half-ascending (w-2) -permutation, and the number of runs can be replaced 
by the ^-parameter. In particular, p„-?Tfi„--i will hold, and therefore we can verify 
that our statement holds in both cases {pn'fCpns or pfPpf) exacdy as we did in 
the proof of the initial case. | 

So almost half of the roots of G„(x) are equal to -1, in particular, they are real 
numbers. This raises the question whether the other half are real numbers as 
well. That question has recently been answered in the affirmativ by Herb Wilf 
[201]. In his proof, he used the rather close connections between Eulerian 
polynomials, and the generating functions G n (x) = G(n, k)x k . This 

connection, established in [67], and given in a more concise form in [136] can be 
described by 

= ,U9) 
j J. • 

where w = y i+j-The proof of (1.19) uses the similarities between the recursive 
formulae for A n (x) and G n (x) to get a differential equation satisfied by certain 
generating functions in two variable. The details can be found in [67], pages 
157-162. 

THEOREM 1.41 

(H. Wilf[201].) For anyfixed n, the polynomial G n (x) has real roots only. 

PROOF From (1.19) it follows that G n (x) can vanish only if either x=-l or 
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x=2yl (1 +f), where y is a zero of A n . Indeed, if y is a root of A n (x) and y — jff ) 
then w = . Therefore, 

I 4 ~ii ' 


l-x _ l-y 
V 1 + x 1 + y‘ 

Squaring both sides and solving for x, we get our claim. As we know that the 
roots of A n are real, our statement is proved. I 

It is possible to continue our argument involving half-ascending permutations to 
give a fully combinatorial proof of the weaker statement that G n (x) is always log- 
concave. We know from Lemma 1.40 that 

G n (x) = (l+x) m G n , m (x). (1.20) 

As the polynomial (l+x)"' is obviously log-concave, and the product of log-concave 
polynomials is log-concave (see Exercise 22), the log-concavity of G„(x) will be 
proved if we can prove the following Lemma. 

LEMMA 1.42 

For all integers n> 4, the polynomial G„ im (x) has log-concave coefficients. 

We prove the lemma for even values of n. See Exercise 27 for the necessary 
modifications for odd n. The following Proposition is obvious. 

PROPOSITION 1.43 

Let n be an even positive integer. Let p be a half ascending n-permutation. Then p 
has 2k+\ runs if and only if p has k descents, or, in other words, when 
t(p) = k- t-1. 


PROOF (of Lemma 1.42). The reader is asked to review the proof of Theorem 
1.26. In that proof, the log-concavity of the Eulerian numbers was established 
by an injective map <I>. This map ® acted on pairs of lattice paths, that 
corresponded to pairs of permutations. Now note that in that lattice path 
representation of permutations, half-ascending permutations correspond to 
lattice paths in which all even-indexed steps are horizontal. Observe that O 
preserves this property, that is, the restriction of ® to the set of pairs of haf- 
ascending permutations in A(n, k-1 )xA(n, k+1) is an injection into the set of pairs 
of haf-ascendingperrcmtaxions in A(n, k) xA (n, k). This, together with Proposition 
1.43, proves our claim. I 
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Exercises 

1. Simplify the formula obtained for Cx(S) in Lemma 1.3. 

2. Letp+l be a prime. What can be said about A(p, k) modulo p+ 1? 

3. Find an alternative proof for the fact that A(n, k+\)=A(n, n-k). 

4. What is the value of A* (1)? Here A„(x) denotes the nth Eulerian 
polynomial. 

5. Prove Proposition 1.25. 

6. We have n boxes numbered from 1 to n. We run an w-step experiment as 
follows. In step i, we drop one ball into a box, chosen randomly from 
boxes labeled 1 through i. So during the entire experiment, ("J 1 ) balls 
will be dropped. Let B(n, k) be the number of experiments in which at 
the end, k-\ boxes are left empty. Prove that B(n, k)=A(n, k). 

7. Deduce Theorem 1.8 from Theorem 1.7. 

8. Prove that for all positive integers k<n, we have 

S(n, k)=S(n-l, k-l)+kS(nrl, k). 


9. Prove that 


A(n,k) 


22 (~v h ~ k+1 

h—k—l 


h 

k-1 


S(n,n — h) ■ (n — h)\. 


10. Let p=pip 2 ■••p n be a permutation, and let b t be the number of indices jKi so 
that Pppi■ Find a formula for the number C(n, k) of arrays (bi, b 2 , —, b n ) 
obtained this way in which exacdy k different integers occur. Note that the 
permutation statistic defined as above is often called the Dumont statistic, 
and its value on the permutation p is denoted by dmc(p). 

11. Prove that 

n 

A n (x) = x klS(n, k)(x — \) n ~ k . 

k =1 

12. Prove that 

k 

22 A(n, k) < k n . 

i= 1 
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13. We say that i is a weak excedance oi p=pipvp n if p^-i- Assuming Theorem 
1.35, prove that the number of w-permutations with k weak excedances 
is A(n, k). 

14. Prove that for all positive integers n, we have 

S(n t l.k t 1) = ^2 ( )s( m > k). 

m~k ' m ' 

15. An n-permutation is called alternating if it has descent set (1, 3, 5, •••}. So 
3142 and 52413 are both alternating permutations. Let E n be the number 
of alternating w-permutations. 

a Find a recursive formula for E n+] . 

b Find the exponential generating function E(x) = ^Z n>1 E n W. 

16. Find a combinatorial proof for Corollary 1.18. 

17. Let r be a positive integer, and let i £ \n— 1] be an r-descent of the 
permutation p=pipr--p„ if p(i)=p(i +l)+r. Let A(n, k, r) denote the number 
of all w-permutations with k-1 such r-descents. The numbers A(n, k, r) are 
called the r-Eulerian numbers. Prove that 

A(n, k, r)={k+r-\)A(n-\, k, r) + (n+2-k-r)A(n-l, Cl, r). 

18. Let A(n, k, r) be defined as in the previous Exercise. Prove that 

19. Are the r-Eulerian numbers, defined in Exercise 17 and the /-Takacs-Eulcrian 
numbers, defined in Problem Plus 3 identical? (Try to give a very short 
solution.) 

20. Let k be a fixed positive integer. Find the ordinary generating function 
F (x) = J2 n > k S{n } k)x n . 

21. Prove that for all positive integers n, we have 

n 

X n = S(n,m)(x) m . (1.21) 

m =0 

Recall that (x) m =x(x-l)‘-(x-m+l). 

22. Let P(x) and Qfx) be two polynomials with log-concave and positive 
coefficients. Prove that the polynomial P(x) Qfx) also has log-concave 
coefficients. 
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23. Is it true that if P(x) and Q[x) are two polynomials with unimodal coefficients, 
then P(x)Q[x) also has unimodal coefficients? 

24. Is it true that if P(x) and Q[x) are two polynomials with symmetric and 
unimodal coefficients, then P(x)Q^x) also has symmetric and unimodal 
coefficients? 

25. (a) Find an explicit formula for G(n, 2). 

(b) Find an explicit formula for G(n, 3). 

Here G(n, k) is the number of w-permutations with k alternating runs. 

26. Let n be a fixed positive integer. For what pairs (k, m) does there exist an n- 
permutation with k descents and m alternating runs? 

27. Prove Lemma 1.42 for odd values of n. 

28. Prove that for n sufficiently large, we have (k-\)"KG(n, k)<k n , for all 
k>2. 

29. A sequence f: N—>C is called Precursive if there exist polynomials 
Pa,Pir“>Pk <E Q [n], with P k J=0 so that 

Pk(n+k)f(n+k)+P kl (n+k-l)f(n+k-l)+-+P 0 (n)f(n)=0 (1.22) 

for all natural numbers n. Here /'’-recursive stands for “polynomially 
recursive”. For instance, the function^defined by f(n) =n\ is P-recursive as 
f(n+l)-(n+l)f(n)=0. 

Prove that for any fixed k, A(n, k) is a Precursive function of n. 

30. Prove that for any fixed k, the function S(n, k) is a polynomially recursive 
function of n. 

31. A decreasing binary tree is a rooted binary plane tree that has vertex set [n] 
and root n, and in which each vertex has 0, 1, or 2 children, and each 
child is smaller than its parent. Prove that the number of decreasing binary 
trees is n\. 

32. Prove that the number of decreasing binary trees on [n] in which k-\ vertices 
have a left child is A(n, k). 

33. Let 2<i<n-\. We say that/>,-is a peak of the permutation p= p\pr--p„ if p t 
is larger than both of its neighbors, that is pi.\<pi and pl>p l+1 . Let ri> 4, 
and let k>9. Find a formula for the number Peak(n, k) of w-permutations 
having exactly k peaks. 

34. How many decreasing binary trees are there on n vertices in which exactly 
one vertex has two children? 
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1 



The minimax tree of^=5613724. 


35. Use decreasing binary trees to prove that for any fixed n, the sequence of 

Eulerian numbers { A(n, is symmetric and unimodal. 

36. Which is the stronger requirement for two permutations, to have the same 
set of descents, or to have decreasing binary trees that are identical as 
unlabeled trees? 

37. The minmax tree of a permutation pip 2 —p„ is defined as follows. Let p=umv 
where m is the leftmost of the minimum and maximum letters of p, u is the 
subword preceding m and "U is the subword following m. The minmax tree 
T™ has m as its root. The right subtree of T™ is obtained by applying the 
definition recursively to V. Similarly, the left subtree of T’p is obtained by 
applying the definition recursively to u. See Figure 1.10 for an example. 

(a) Prove that if 1< i<n-2, then there are nl/3 permutations p so that p { is 
a leaf in T™. 

(b) How many w-permutations p are there so that (resp. p^j is a leaf 
in T p m ? 

38. +Let p=p\pr--pn be a permutation, and let x 6 [«]. Define the x- 
factorization of p into the set of strings uk(x)xy(x)v as follows. The string 
P(x) is the longest string of consecutive entries that are larger than x 
and are immediately on the left of x, and the string y(x) is the longest 
string of of consecutive entries that are larger than x and are immediately 
on the right of x. Finally, u(x) and v(x) are the leftover strings at the 
beginning and end of p. Note that each of y(x), u, and x> can be 
empty. 
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FIGURE 1.11 

The labeled grid for £=4 and rc=8. 


For instance, if p=3 1478526 and x=4, then k= 31, A(a;) = 0, y(x)=785, 
and 0 = 26 . 

The notion of Andre permutations proved to be useful in various areas of 
algebraic combinatorics. We say that p is an Andre permutation of the first 
kind if 

(a) There is no i so that pfipn.C>P M , and 

(b) y{x) = 0 =i> X(x) — 0, and 

(c) if y(x) and X(x) are both nonempty, then max k(x)<max y(x). 

Prove that p is an Andre permutation of the first kind if and only if all non¬ 
leaf nodes of the minmax tree T™ are chosen because they are minimum 
(and not maximum) nodes. 

39. Attach labels to the edges of a kx (n-k+ 1) square grid of points as shown in 
Figure 1.11. That is, both the edges of column i and row i get label i. 
Take a northeastern lattice path s from the southwest corner to the northeast 
corner of the grid. This path s will consist of n -1 steps. Define the weight P s 
of s as the product of the labels of all edges of s. Prove that 

A(n,fc) = ^P s , 

8 

where the sum is taken over all (”“|) northeastern lattice paths s from the 
southwest corner to the northeast corner. 

40. Prove, preferably by a combinatorial argument, that if k<fn-l)l2, then we 
have G(n, k)=G(n, k+1). (Note that the sequence is growing even further 
than that, but we do not yet have the methods to prove it.) 
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41. Let r be a positive integer, and modify the labeling of the vertical edges in 
the previous exercise so that the label of the edges in column i is i+r-l 
instead of i. Prove that 


A(n,k,r ) = r!]TP s , 

s,r 


where A (n, k, r) is an r-Eulerian number as defined in Exercise 17, the 
weight P s of a path s is still the product of the labels of its edges, and 
the sum is taken on all (”~jj northeastern lattice paths from ( 0 , 0 ) to 
(LI, n-r-k). 


Problems Plus 

1. A simplicial complex is a collection A of subsets of a given set with the property 
that if E E A, and F C E. then F € A. The sets that belong to the collection 
A are called the faces of A. If 5 £ /\ has i elements, then we call 5 an (z-1)- 
dimensional face. The dimension of A is, by definition, the dimension of 
its maximal faces. 

Prove that there exists a simplicial complex A whose set of (z-1)-dimensional 
faces is in natural bijection with the set of //-permutations having exaedy 
z -1 descents. 

2 . (a) Let The a rooted tree with root 0 and non-root vertex set [n]. Define 

a vertex of The a descent if it is greater than at least one of its children. 
Prove that the number of forests of rooted trees on a given vertex set 
with zTl leaves and j descents is the same as the number of forests of 
rooted trees with j +1 leaves and i descents. 

(b) Why is the above notion of descents a generalization of the notion of 
descents in permutations? 

3. Define the /-Stirling numbers of the second kind by the recurrence 

5(zz+l, k, l)=S{n, LI, l)+k!S(n, k, l), 

and the initial conditions 5(0, 0, /)=1, S(n, 0, l)= 0 for m> 1, and 5(0, k, 
l}= 0 for k>\. Note that for /= 1 , these are just the Stirling numbers of 
the second kind as shown in Exercise 8 . Define the /-Takacs-Eulerian 
numbers by 


A t (n,k,l)= (-!) r fe+1 f fc ^ 1 )5'(n,n-r,/)[(n-r)!]'. 
r=k -1 ^ 
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FIGURE 1.12 

The regions for n =2 

Note that in the special case of /=1, we get the Eulerian numbers, as 
shown in Exercise 9. Prove that these numbers generalize Eulerian 
numbers in the following sense. Modify the experiment of Exercise 6 so 
that in each step, / balls are distributed, independendy from each other. 
Prove that j s th e probability that after n steps, exactly k boxes 

remain empty. 

4. Let k—n be fixed positive integers. Compute the volume of the region R n j< 
of the hypercube [0, 1]“ contained between the two hyperplanes 

Xi = k — 1 and J]” =1 x , = k. See Figure 1.12 for an illustration. 

5. (a) Let p=pip 2 -p n be a permutation, and define 

S P = 11 * -j\ - \Pi -Pill- 

l<i<j<n 

Prove that the smallest possible positive value of dp is 2w-4. 

(b) Which graph theoretical problem contains part (a) as a special 
case? 

6. Let G be a graph. A k-coloring of G is the number of ways to color the 
vertices of G using only the colors 1, 2, ■■■, k so that adjacent vertices 
have different colors. Let P(n) be the number of w-colorings of G. It is 
then well-known that P(n) is a polynomial function of n, called the chromatic 
polynomial of G. 

Now let 

F g (: x) = '52 p (n)x\ 

n> 0 

It is proved in [144] that F3 ( x ) = p—^yipr, where Q[x) is a polynomial of 
degree m, and with nonnegative integer coefficients. 

So we can set Q(x ) = YHi=k w i xl i where k is the smallest number for 
which G has a ^-coloring, called the chromatic number of G. 
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(a) Find a combinatorial interpretation for the numbers w p in terms of 
permutations. 

(b) Explain why the polynomial Qfx) is a generalization of the Eulerian 
polynomials. 

7. Let n, i, and j be fixed positive integers, and set 

S(i,j,n) = kfn-ky. 

0<k<n 

Prove that 

S(i,0,n) = Y (r+i) r!S ( i ’ r )- 

8. Let us say that a permutation p contains a tight ascending run of length k 
if it has k consecutive entries pp M ---P i+iri so that p, +J =pi+j- 1 for 0<j<k-l. 
In other words, the sequence pp,+cp,+k-\ is a sequence of consecutive 
integers. 

Find a formula for the number of permutatations of length r+k containing 
a tight ascending run of length at least k, if k>r. 

9. The Bessel number B(n, k) is defined as the number of partitions of [n] into k 
nonempty blocks of size at most two. Prove that for any fixed n, the sequence 
B(n, 1), B(n, 2), •••, B(n, n) is unimodal. 

10. (a) Prove that if n= 2 m -l for some positive integer m, then all Eulerian 
numbers A(n, k) with 1 <k<n are odd. 

(b) Generalize the statement of part (a). 


Solutions to Problems Plus 

1. This result is due to V.Gasharov [103], who used the same lattice path 
model in his solution as he used to injectively prove that the Eulerian 
polynomials have log-concave coefficients. 

2. (a) This result is due to I.Gessel [109]. Let d(F) be the number of descents 

of a forest Land let 1(F) be the number of leaves of F. Then let u„(a, ft) 
be the bivariate generating function 

u n (aJ) = Y« d{F) P l{F) - 1 , 

F 


2004 by Chapman & Hall/CRC 



In One Line And Close. Permutations as Linear Orders. 


39 


where the sum is over all rooted forests on [n]. Then Gessel shows 
that the trivariate generating function 



is symmetric in a and [5 by proving that it satisfies the functional 
equation 


1+ U=(l+aU) (l+pU)e*- 


(b) If the number of leaves is one, then the tree consists of one line, and 
the sequence of the vertices corresponds to an n-permutation. The 
notion of descents of the tree then simplifies to that of descent in this 
permutation. 

3. This result is due to L.Takacs [186], though note that his paper denoted 
the Eulerian number A(n, k) by A(n, k- 1 ). The main idea of the proof is the 
following. Let 



k=r 


where P(n, k) is the probability that at the end of the trials there are k empty 
boxes. Then it can be proved that 



by showing that both sides satisfy the same recurrence relations. Then, by 
the formula P(k,n) = l) r-fc (^)B r (n), our claim follows. 

4. The volume of R mk is equal to A(n, k)/k\. A nice combinatorial proof was 
given by R.Stanley [181], though the result was probably known by 
Laplace. The main element of Stanley’s proof is the following 
measurepreserving map. It is straightforward that A(n,k)/k\ is the volume 
of the set S n , k of all points (aq, x 2 , ■ • •, x n ) 6 [0,1]" for which x lA <x t for 
exactly k values of i. (This includes i=0, where we set x 0 =0.) Let J[x h x 2 , 
■, x„)=(y h y 2 ,-,y«) where 



Note that /is not defined on the set of points where x ti =x t for some i, but 
that is not a problem as the set of those points has volume zero. Apart 
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from that, however, f maps the rest of S n:/l into R(n, k) as 
k — 1 < f2i=i = k — x n < k. Stanley then shows that apart from a 
subset of volume zero of R(n, k), the map/"has an inverse, and that/tis an 
affine transformation of determinant (-1)”, implying that f is order¬ 
preserving. 

5. (a) This result is due to W.Aitken [1], He called 8 P the total relative 

displacement of p. 

(b) Let G be a graph with n vertices, and let d(x,y) be the graph- 
theoretical distance (number of edges in the shortest path) between 
x and y. Then, for a permutation p of the vertices of G, one can 
define 

S G, P = X \ d (P*>Pv) - d (®tSOI- 

Then part (a) corresponds to the special case when G is the path 
12 —n. Also note that 8 p of part (a) is equal to 0 if and only if 
p=Y23---n or p=n---32l, which is also a special case of the general 
fact that 8(;p=0 if and only if p is an automorphism of G. 

6. This result is due to I.Tomescu [189]. In that paper, various formulae are 
proved for the numbers w h . 

(a) Let I be an acyclic orientation of G, and let G have m vertices. The 
transitive closure I’ is then a partial ordering of [mj. Let fhe a bijedive 
coloring of the vertices of /that is compatible with I’. In other words, 
if x<y, thcn /(xj</' f(y). Finally, let T(I) be the set of all total orders that 
extend I’. In other words, the I'(I) are all the possible choices for the 
bijective coloring /. 

Now for any f €. T (/), note that/tin fact defines a permutation of [m]. 
Let U(I) be the set of all these permutations. Finally, let M(G) be the 
multiset obtained by taking the union of all T(I), for all acyclic 
orientations /of G, preserving the multiplicities. 

It is then proved in [189] that for any graph G, the coefficient w k is the 
number of permutations in M(G) that have k ascents. 

(b) If G is the empty graph on m vertices, then M(G) contains all ml 
permutations of length m, and then Qfx)=A m (x)/x. 

7. There are several papers that are devoted to exploring connections between 
powers of integers and Stirling numbers of the second kind, or Eulerian 
numbers. See, for instance, [117] for this result. 

8. It is proved in [126] that this number is r\{ri+r+ 1). 
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9. It is easy to prove that 


B(n) 


n! 

2 n ~ k (n — k)\(2k — n)! ’ 


and then the result follows by checking that the sequence B(n+l)IB(n) is 
decreasing, therefore if it dips below 1, it has stay below 1. This result was 
published in [59]. 

10. (a) It is well-known (see, for example, [27], Exercise 14 of Chapter 4) that 
if t is a power of 2, then ( fc ) is even, except when k=t or k=l. In our 
case, this means that ("^ 1 ) is always even, except in those special cases, 
and the claim follows from Theorem 1.11. 

(b) Similarly, if n=p n -1 where p is a prime, then all Eulerian numbers 
A (n, k) are congruent to 1 modulo p. 

A combinatorial proof of these facts not using Theorem 1.11 can be 
found in [187]. 
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2.1 Inversions 

2.1.1 The generating function of permutations by inversions 

InSection 1.3, we looked at descents of permutations. That is, we studied instances 
in which an entry in a permutation was larger than the entry directly following it. A 
more comprehensive permutation statistic is that of inversions. This statistic will 
look for instances in which an entry of a permutation is smaller than some entry 
following it (not necessarily directly). 

DEFINITION 2.1 Lei p=j)\pr--p n be a permutation. We say that (pi,p) is an inversion of 
p f ifj but pP>P r 

Example 2.2 

Permutation 31524 has four inversions, namely (3, 1), (3, 2), (5, 2), and (5,4). 

□ 

This line of research started as early as 1901 [151]. In this section, we survey 
some of the most interesting results in this area. The number of inversions of p 
will be denoted by i(p), though some authors prefer inv(p). It is clear that 
0 < iip) < (") for all w-permutations, and that the two extreme values are 
attained by permutations 12--w and n(n- 1)—1, respectively. It is relatively easy to 
find the generating function enumerating all permutations of length n with respect 
to their number of inversions. 

THEOREM 2.3 

For all positive integers n^2, we have 

y x^ pS> = I n {x) = (1 + z)(l + x + x 2 ) ■ ■ • (1 + x + x 2 H-b x n -f. 

p€S n 

PROOF We prove the statement by induction on n. In fact, we prove that each 

43 
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of the n! expansion terms of the product I„(x) corresponds to exactly one 
permutation in S„. Moreover, the expansion term x° l x a ' 2 • • • x“ n ~ 1 will correspond 
to the unique permutation in which the entry z+1 precedes exactly i entries that 
are smaller than itself. 

If n=2. then there are two permutations to count, /z=12 has no inversions, and 
p = 21 has one inversion. So X%<<P ’ = 1 + x as claimed. Furthermore, 

p= 12 is represented by the expansion term 1, and p—2i is represented by the 
expansion term x. 

Now assume that we know the statement for n- 1, and prove it for n. Let p 
be a permutation of length n- 1. Insert the entry n into p to get the new 
permutation q. If we insert n into the last position, we create no new 
inversions. If we insert n into the next-to-last position of^, we create one new 
inversion as n will be larger than the last element of q. In general, if we insert 
n into p so that it precedes exactly i entries of p, we create i new inversions as 
n will form an inversion with each entry on its right, and with no entry on its 
left. Therefore, depending on where we inserted n, the new permutation q has 
0 or 1 or 2, etc., or n- 1 more inversions than p did. If/; was represented by the 
expansion term x ai x° 2 ■ ■ ■ x (ln 2 , and n is inserted so that it precedes i entries, 
then q is represented by the new expansion term x ai x a ’ 2 ■ ■ •x an ~ 2 x l - This 
argument works for all p, proving that 

/„ (x)=(l+x-\ -Hx”- 1 )/^! (x)=(l+x) (1 +X+X 2 ) ■■ ■ (1+xH- hx"' 1 ). 

■ 

Therefore, the number b(n, k) of w-permutations with k inversions is the 
coefficient of x !: in /„ (x). The fact that the polynomial I„ (x) can be decomposed 
into a product of factors enables us to prove the following result on these 
numbers. 

COROLLARY 2.4 

For anyfixed n, the sequence b(n, 0), b{n, 1), —, b(n, Q) is log-concave. 

PROOF Let us call a polynomial log-concave if its coefficients form a log- 
concave sequence. It is then not hard to prove (see Exercise 22 of Chapter 1) 
that the product of log-concave polynomials is log-concave. The previous 
theorem shows that the generating function of our sequence is the product of 

several log-concave polynomials (of the form l+x+xN-b xh), therefore our 

sequence itself is log-concave. 

The first few values of the numbers b(n, k) are shown in Figure 2.1. 

We would like to point out that it is not true that I fix) has real roots only. 
Indeed, if ri> 3, then I fix) is a multiple of 1+x+x 2 , and therefore has some 
complex roots. 
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n=l 

n=2 


1 1 

12 2 1 


1 


n=3 


n=4 


1 3 5 6 5 3 1 


n=5 


1 4 9 15 20 22 20 15 9 4 1 


FIGURE 2.1 

The values of b(n, k) for n<5. Row n starts with b(n, 0). 

We have found, with not much effort, the generating function of the numbers 
b(n, k) for fixed n. An enumeradve combinatorialist will certainly ask next whether 
it is possible to find a recursive, or even better, an explicit formula for these 
numbers, just as we did for the numbers A(n, k) in Section 1.1. As we will see, the 
latter is a somewhat more difficult task. 

As a warmup, we prove a recursive formula. 


LEMMA 2.5 


Let ri>k. Then we have 


b(n+ 1, k)=b{n+ 1, k-\)+b(n, k). 


( 2 . 1 ) 


PROOF Let p=pipr"pn+i be an (w+1)-permutation with k inversions, where 
k<n. If p n+ i=n- 1-1, then we can omit n +1 from the end of p and get an n- 
permutation with k inversions. If p=n+l for i<n, then let us interchange n +1 
and the entry immediately following it. This results in an (n+1) -permutation 
with k-1 inversions in which the entry n +1 is not in the first position. 
However, all (w+1)-permutations with k-1 inversions have that property (that 
n+1 is not in the first position) as putting n +1 to the first position would result 
in at least n>ti>k -1 inversions. This completes the proof, and also shows why 
the condition ri>k is needed. I 

Even without going into details, it is obvious that (2.1) does not hold in general, 
that is, when K>n. For instance, if k > (”) /2, then b(n+ 1, A)< b(n+ 1, k-1), which 
makes it impossible for (2.1) to hold as b(n, k)>Q. See Exercise 28 for a recursive 
formula for the case when lt>n. 

Finding an explicit formula for the numbers b(n, k) is significandy more difficult, 
even if we assume riAk. A little examination of the polynomial shows that 
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In (x) = EEo b ( n > k ) xk Sh ° WS tHat 


b(n, 1) = n — 1 


b(n, 0) = 1 


n 


0 



n > 1, 



n > 2 


n > 3 


n > 4. 


In order to see how these results are obtained, and to obtain a general formula, 
we need some notions that most readers are probably familiar with. 

DEFINITION 2.6 Let n be a positive integer. Ifai+apr...+a k =n, and the a, are all 
positive integers, tlwn we say that the k-tu pie («i, a%, —, a t ) is a composition of n into k 
parts. If the a, are all nonnegative integers, then we say that the k-tuple (<%, a 2 , —, a t ) is a 
weak composition of n into k parts. 

In the unlikely event that the reader has not met compositions before, the 
reader should take a moment to prove that the number of compositions of n 
into k parts is whereas the number of weak compositions of n into k 

parts is 

For instance, to get b(n, 2) as the coefficient of x 2 in I n (x), one has to count the 
weak compositions of 2 into n- 1 nonnegative parts, the first of which is at most 1. 
Indeed, one needs to find the coefficient of y? in the generating function I n (x)=( 1+x) 
(l+x+x^-jl+xH—bx”" 1 ). The number of all weak compositions of 2 into n- 1 
parts is ( 2+ ^Z] ~ ] j = ( n " i) = n , one of which consists of a first part equal to 2. 
This proves that bin, 2)=n-\. See Exercises 3 and 4 for proofs in the cases of !<=?> 
and i=4. 

This line of formulae suggests that maybe the formula for b(n, k) will be 
obtained by taking the difference of two suitably chosen binomial coefficients. 
However, this conjecture is false as we have 



Further conjectures claiming b(n, k) to be an alternating sum of binomial 
coefficients also turn out to be false. The truth is a bit more complicated than 
that. 
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FIGURE 2.2 

The Ferrers Shape of p=(5, 2, 1). 


Our main tool in finding the correct formula comes, remarkably, from the 
theory of partitions. Many readers are probably familiar with the following 
definition. 

DEFINITION 2. 7 Let . .>«„> 1 be integers so that aprap - Ya m =n. Then the 

array a^(a h a 2 , —, af) is called a partition of the integer n, and t!w numbers ai are called the 
parts of the partition a. The number of all partitions of n is denoted by p(n). 

Example 2.8 

The integer 5 has seven partitions, namely (5), (4, 1), (3, 2), (3,1, 1), (2, 2, 1), (2, 
1, 1, 1) and (1, 1, 1, 1, 1). Therefore, p(5)=7. [] 

The topic of integer partitions has been extensively researched for several 
centuries, from combinatorial, number theoretical, and analytic aspects. See [6] 
for a survey. 

We will use the following, simple, but extremely useful, representation of 
partitions by diagrams. A Ferrers shape of a partition p=(a,i,av, —, a t ) is a set of n 
square boxes with sides parallel to the coordinate axes so that in the zth row we 
have a, boxes and all rows start at the same vertical line. The Ferrers shape of the 
partition p=( 5, 2, 1) is shown in Figure 2.2. Clearly, there is an obvious bijection 
between partitions of n and Ferrers shapes of size n. 

We will need some basic facts about the generating functions of various 
partitions. 

PROPOSITION 2.9 

The ordinary generatingfundion of the numbers p(n) is 

OO -j 

E^ n= n^ (2.2) 

n>0 i —1 

PROOF We can decompose the right-hand side as 


(l + X+X 2 +-")(l + X 2 +X 4 +-")"-(l-|-X‘+X 2 '+•■■). 
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It is now clear that the coefficient of xf 1 in this product is equal to the number 
of vectors (q, c 2 , ••■) with nonnegative integer coefficients for which 
—i ic» = n - Note that such a vector can have only a finite number of 
nonzero coordinates. Finally, there is a natural bijection between these 
vectors and the partitions of n. This bijection maps (q, c 2 , •■■) into the 
partition that has c, parts equal to i. So the coefficient of x" on the right-hand 
side is p(n). | 

COROLLARY 2.10 

Let Pfn) be the number fpartitions of n into distinct parts. Then we have 

OO 

J2Pd(n)x n = Y[(l + P). ( 2 .3) 

n>0 i =1 


COROLLARY 2.11 

Letp(n, m) be the number of partitions of n into at most mparts. Then we have 

m ^ 

= (2.4) 

n>0 i=l 

A. pentagonal number is a nonnegative integer n satisfying n = |(3j 2 ± j) for 
some nonnegative integer j. So the first few pentagonal numbers are 0, 1, 2, 

5, 7, 

The following partition identity is the most interesting one in our quest for 
the formula for the numbers b(n, k). It is not nearly as easy as the preceding two 
corollaries. 

LEMMA 2.12 

Let po(n, d) (resp. pfn, d)) be t!w number of partitions of n into distinct odd parts (resp. 
distinct even parts). Then we have 


{ 0 if n is not pentagonal, 

(-1)7 if n = |(3 j 2 ±j). 

PROOF We will construct an almost bijection between the set of partitions of 
n into distinct even parts and the set of partitions of n into distinct odd parts. 
The bijection will work for all non-pentagonal integers n. If n is pentagonal, 
however, then the bijection will only almost work, that is, there will be one 
partition for which it will not. This “almost” property of the bijection is 
precisely the reason for which the lemma is a little bit more difficult to prove 
than the previous corollaries. After all, we should not find a bijection 
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FIGURE 2.3 

We have b(p)= 3, g(p)= 1, and b(q)= 1, g4)=2. 


here that works for all partitions as that would “prove” a false statement. Finding 
the single exceptional partition is not trivial. 

Let qz, (jk) be any partition of n. We define two parameters for q. 

The simpler one, g(q) (for gray) is just the size of the smallest part of q. In other 
words, g(q)=qh The other one, b(q) (for black) is the length of the longest 
strictly decreasing subsequence (qi, <y 2 , ■•■) of parts of q in which each part is 
exactly one less than the part immediately preceding it. So in particular, if 
q^q x -\, then b(q)= 1 . 

See Figure 2.3 for an illustration. 

The set of black boxes will be called the outer rim of a Ferrers shape. Now 
let p be a partition of n into an even number of distinct parts. We distinguish 
two cases. 

1. If g(q)<b(q), then we remove the last part of q, and add one to each of the 

first b(q) parts of q, to get the partition /(?). g(q)<.(q), then this operation 

decreases the number of parts by one. On the other hand, this operation 
always keeps all parts distinct. 

2. If g(q)~>b(q), then we remove one from each of the first b(q) parts of q, then 
affix a last part of size b(q) to the end of q to get the partition /(<?)■ This 
operation increases the number of parts by one, and if b(q)Kg(q)- 1 , it keeps 
the parts all distinct. 

So the only times when <p does not define a map from the set of partitions 
enumerated by p t (n, d) into the set of partitions enumerated by p 0 (n,d) are as 
follows. 

(A) When g(q)=b(q), then the last part of q and the outer rim of its Ferrers 

shape both consist of j boxes. Therefore, the partition q is of the form 
q=(yLj-\, 2/2, •■■, 7 ), so n = The reader should verify this with a 

small example, such as the partition q=( 3, 2 ). 

(B) When b(q)=g(q)-l, then the outer rim of q consists of j boxes, and its last 
part is 7 +I. Therefore, we have q={2j] 2/1, —,/+l), so n — ( 3 J+ 1 )^ . The 
reader should verify this with the small example q={ 4, 3). 
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50 Combinatorics of Permutations 

Note that in both of these exceptional cases, the integer j had to be even, to 
assure that q had an even number of parts. 

So in all cases but the exceptional cases (A) and (B), our function rip maps into 
the set of partitions of n into an odd number of distinct parts. It is clear that cp is 
one-to-one. Let us examine whether cp is suijective. 

Let r be a partition of n into an odd number of distinct parts, and let us try to 
find the preimage of r under cp. If g(r)<ib(r), then this preimage can be found by 
removing the last part of r and adding one to each of the first b(r) parts of r. If 
g(rf>b(r)-\-1 , then this preimage can be found by decreasing each of the first b(r) 
parts of r by one, and creating a new, last part of r that is of size b(r). So we can 
find the unique preimage <p _1 (r) of r unless 

(A’) b(r)=g(r), so r=(2fl, 2/2, ■••,/, and therefore, n = f 3 -?" 1 )! , or 

(B’) b(r)=g(r)- 1, so r=(2j, 2/1, ■■•,/l-l), and therefore, n = 

Also note that in cases (A) and (B’), the number / has to be odd to ensure that r 
has an odd number of parts. 

In other words, if n is not a pentagonal number, then <p is a bijection from 
the set of partitions enumerated by p e (n, d) onto the set of partitions enumerated 
by p 0 (n, d). 

In exceptional cases (A) and (B), (which occur when n = for some 

even positive integer j), there is one partition in the domain of cp that does not 
get mapped into a partition consisting of an odd number of parts, showing that 
pfn, d)-p a (n, /)=l=(-l) / as j is even. 

In exceptional cases (A’) and (B’), (which occur when n = for some 

odd positive integer j), there is one partition of n into a distinct number of 
odd parts that does not have a preimage under </, proving that pfn, d)-p 0 (n, 
d)= -1=(-1)/as j is odd. 


Now let p(n, m, d) be the number of partitions of n into m distinct parts. As a 
consequence of the previous lemma, note that if n is not of the form (3/+ j)I2 or 
[3f-j)/2 for some nonnegative integer/ then we have 

n 

^2(-l) m p(n,rn,d) = 0. 
m —1 


Otherwise, we have 

n 

1 ) m p(n,m,d) = (-l)L 

m =1 

The following Corollary links the pentagonal numbers to the enumeration of 
permutations according to their number of inversions. 
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COROLLARY 2.13 

[Euler’sformula.] We have 

f(x) — (1 — a;)(l — £ 2 )(1 — a; 3 ) ■■■ = 1 — x — x 2 + x 5 + x 7 — x 12 — ■ • - 

OO 

= £ (—l y x lv*+J)/*. 

j>-oo 


PROOF The left-hand side is similar to the generating function of the 
numbers pi(n) as given in (2.3), except for the negative sign within each 
term. This implies that the coefficient of x” on the left-hand side is not simply 
the sum of all the numbers pfn, m), but their signed sum 'Lfilfpfn, m). We 
know from Lemma 2.12 that this sum is 0, except when n is of the form 
(3/+/)/2 or (3f-j)/2, in which case this sum is equal to (-1)/. This completes 
the proof. 

We mention that the rather unusual summation fijy oo is used to include 
pentagonal numbers of the form (3f+j)/2 and (3f-j)/2 in the same sum. One 
can think of the sum 2 as the sum in which j ranges 

through all integers in order 0, -1, 1, -2, 2, -3, 3, ■■■. For j G denote 
#=(3/+y)/2. 

Recall that by Theorem 2.3, the polynomial L n (x) can be rearranged as 
follows. 

In(x) ~ H (1 + x + - ■ - X 1 - 1 ) = ; — ; 7 

1=1 Z =1 

While I n (x) is a polynomial and {i~ x ) n is an infinite product, their factors of 
degree at most k agree, therefore their coefficients for terms of degree at most k 
also agree. So our task is reduced to finding the coefficient of xh in 

f{x) ■ (1 - *)-" = fix) -W n + k ~ A 

ft >0 ^ ' 

where we set = 1 , In order to get a term with coefficient k in the product 
f(x)\ 1 -x)'”, we have to multiply the term (— l)-*a?® +l)/2 ~ (—if x dj of fix) by 
the term of (1-x)'” that has exponent kfi, that is, in which h=k-dj. Therefore, the 
coefficient of xh in Ijx) is, for riAk, 

«->'*) = £<-!>■ i n + t: d ffi) (2.5) 

where j is such that the pentagonal number dj is at most as large as k. 

The first few pentagonal numbers are shown in Figure 2.4. 
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FIGURE 2.4 

The first five pentagonal numbers. 


Expanding (2.5), we see 
b(n, k ) = ^ 

+ (” 


that the formula 

+ n-e 

:vm b 


for b(n, k) 

+ k- 2 \ 
jfc-1 ) 
+ k — 8 \ 
k-7 ) 


starts as follows. 
bn + k — 3\ 

V k-2 ) 


2.1.2 Major index 

Just as the number of descents, the number of inversions of a permutation is also 
obtained by some quite unrelated-looking statistics. The most famous of them is 
the major index, that was named after the rank of its inventor, Percy MacMahon, 
in the British Army. 

DEFINITION 2.14 Let p=p\pv -p n be a permutation, and define the major index 
or greater index maj(p) of p to be the sum of the descents of p. That is, 

maj(p)=ffieD( p ) l - 

Example 2.15 

Ifp= 352461, then D(p)={ 2, 5}, therefore maj(p)=7. Q 

In 1916, MacMahon showed [147] the following surprising theorem by proving 
that the two relevant generating functions were identical. It was not until 1968 
that a bijective proof was found by D.Foata [87], who worked in a more general 
setup. We present his proof in the simplified language of permutations. 

THEOREM 2.16 

For all positive integers n and all nonnegative integers k, there are as many n-permutations 
with k inversions as there are n-permutations with major index k. 

In other words, the permutation statistics “number of inversions,” that we denoted 
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by i (but is often denoted by inv ), and “major index,” often denoted by maj, are 
equidistributed on S n . If a permutation statistic s has the same distribution on S„ as 
i, then s is called Mahonian. 

PROOF For any permutation p=pip2 ■■■pm we call the entry p, large if p,i>p n , and 
we call p, small if p,<p n . 

We are going to prove our statement by recursively defining a bijection 
0 : S n —» S n so that for all p £ S n , the equality maj(p ) = l(0(p)) holds. Our 
map will have the additional feature of keeping the last element of p fixed. 

It will not surprise the reader that we define 0(1) = 1 for the initial case of 
n= 1 , and </>( 12 ) = 12 and 0(21) = 21 for the case of tf= 2 . 

Now assume that we have defined 4> for all (n- 1 )-permutations. In order to 
define <& for all n-permutations, we distinguish two cases. Let p=p\p2-. .p n be any 
w-permutation. 

1. First we consider the case when p n .\ is a small entry. In this case, take 
wp^fapz.. .p n Q=q x q 2 ...q xl _ x . Let qi t , <?j 2 , • • ■, % be the small entries of p 
in Wp, that is, those that are less than p n . Set i 0 = 0. Let 
Qj = (hj-i+i • • • qij. In other words, the Qj provide the unique 
decomposition of w into subwords that contain exaedy one small entry, 
and contain that small entry in the last position. For instance, if 
^i-"^fi=425613, then there are two small entries, 1 and 2 , and therefore, 
Qi= 42, and 2 2 = 561. Now define 

{ Qj if Qj is of length at most 1, 

x m xiX 2 ■ ■ ■ x m -\ if Qj = X 1 X 2 ■ ■ ■ x m , with m > 2. 


Finally, define 


and 


f(w P )=AQi)AQz)-f(Qk)> 


0(P) = f{Wp)Pn 


Example 2.17 

Let ?f= 5, andj?7=54213. Then we have w v = 0(5421) = 5421, and Q l =542, 
02 = 1 - Therefore, /(ayj=2541, and so 0(p) = 25413. Q 

2. When y;, fl is a large entry, the procedure is very similar. The only difference 
is in the definition of the strings Q r In this case, the Q, provide the unique 
decomposition of w into subwords that contain exactly one large entry, and 
contain that large entry in the last position. 
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Example 2.18 

Let n= 5, and let p= 13452. Then we have w p = 0(1345) = 1345, 
and Q i=13, Q 2 =4, and Q 3 =5. Therefore, f(w p )=3 145, and so 
<t>{p) = 31452. 0 

It is easy to see that <f> : S n —> >S' T , is a bijection. Indeed, verifying both cases, 
one sees that the first rule was used to create 4>{p ) if and only if the last element 
of (j){p ) is larger than the first element of <j>(p ). Once we know which rule was 
used to create ip(p ), we can recover w p from fi(w p ). Indeed, if the first (resp. second) 
rule was used, then xhef(Q) are the subwords that contain only one small (resp. 
large) entry, and contain that small (resp. large) entry in the first (resp. last) position. 
As/is a bijection, recovering \\\c f(QJ this way allows us to recover the Q„ and 
therefore, w p itself. Finally, / ; S n -\ —> S n - 1 is a bijection by induction, so we 
recover pip 2 -p *i l aorsi f(p x p, ! p tlX )=tv p 

We still need to prove that / : S n S n has the desired property, that is, it 
maps a permutation with major index k into a permutation with k inversions. 
We accomplish this by considering the two above cases separately. 

1. When p n _i is a small entry, then 


maj(p) = rnaj(pip 2 ■ ■ -p n - 1 ) = i{fi{piP 2 * * -p n - 1 )) = i{w p ). (2.6) 

How does the map/"change the number of inversions of w(pfi It does 
not change the order among the small entries, or among the large entries. 
If a small entry belongs to the subword Q } of length Z>1, then it jumps 
forward and passes all t -1 large entries of Q„ decreasing the number of 
inversions by t- 1. 

As each large entry will be passed by one small entry, the total decrease in 
inversions is equal to the number of large entries, that is, to np>„. However, 
affixing p n to the end of f(w p ) will create precisely np> n new inversions. 
Therefore, 

i(4>(p)) = i(f(w p )pn) = i(w p ), 

which, compared to (2.6), shows that maj{p) = i(p(p)) as claimed. 

2. When is a large entry, then 

maj{p) = rnaj(j>ip 2 ■ • -p n -i) + (n - 1) (2-7) 

= i(0(PiP2 ■ ■ -Pn-i)) + n- 1 = i( w p ) + n- 1. (2.8) 

Whcn / is applied to w p , each large entry jumps belonging to a subword 
of length t> 1 jumps forward, passes all t- 1 small entries of its subword, 
and increases the number of inversions by t- 1 . Each small entry is passed 
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by one large entry, so the total increase in the number of inversions is 
equal to the number of small entries, that is, p n - 1. On the other hand, 
affixing p n to the end oijfwp) will create precisely n-p„ new inversions. 
Therefore, 

i(<P{p)) = i(f(w p )p n = i(w p ) + ( p n - 1 ) + (n - p n ) = i(w p ) + n - 1 , 
which, compared to (2.7), shows that again, maj(p ) = as claimed. 

■ 

Other Examples of Mahonian statistics can be found among the exercises. 


2.1.3 An Application: Determinants and Graphs 
2.1.3.1 The Explicit Definition of Determinants 


There are several undergraduate mathematics courses and textbooks that only 
give a recursive definition of the determinant of a square matrix. That is, det 

Q is defined to be equal to ad-hc, and then the determinant of the nxn matrix 
A=(a,j) is defined to be 


detA = ^jP(—iy 1 aijAij 

j=i 


(2.9) 


where Ay is the (wT)x(n-l) matrix obtained from A by removing the first row 
and the /ill column. 

If that is the only definition of determinants the reader has seen, he may find 
the following result interesting. 

THEOREM 2.19 

Let A=(a,j) be an nxn matrix. Then we have 

det A = (~1) ^Qipi Q 2 P2 • • • a np n ■ (2 10) 

p€S„ 

That is, det A is obtained by taking all nl possible w-tuples of entries so that there 
is exactly one of the n entries in each row and each column, multiplying the 
elements of each such w-tuple together, finally taking a signed sum of these nl 
products, where the sign is determined by the parity of i(p), and p is the 
permutation determined by each chosen w-tuple. 

In other words, the w-tuples correspond to all possible placements of n rooks 
on an nxn chessboard so that no two of them hit each other. 
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Example 2.20 

Let n= 3. Then there are three 3-permutations with an even number of 
inversions, namely 123, 312, and 231, and there are three 3-permutations 
with an odd number of inversions, namely 132, 213, and 321. Therefore, we 
have 


□ 

PROOF (of Theorem 2.19). We prove the statement by induction on n, the 
initial cases of re=l and n =2 being obvious. Assume the statement is true for (n- 
l)x(w-l) matrices. That means that 

det Aij — 'y ' ( — 1) ' &2q2 a 3q3 a nq n ? ^ XI) 

9 

where q=q 2 qyqn is a. partial permutation, that is, a list of the integers 1, 2, 

;+l, win some order. 

Therefore, ay det Ay will contribute the products of all w-tuples starting with 
ay to det A. In other words, ay det Ay will correspond to all nonhitting rook 
placements in which there is a rook in position j of the first row. This argument 
can be applied for each j. So our theorem will be proved if we can show that the 
signs of these products are what they should be. 

Substituting the expression provided for Ay by formula (2.11) into formula 
(2.9), we see that the sign of the w-tuple that belongs to q becomes (-1 y H @. 
And indeed, the permutation p=jq 2 qyqn has precisely j -1 more inversions 
than the partial permutation q=q 2 qy-q„ as j is larger than j-1 other elements. 
This shows that the contribution of this n-tuple is indeed counted with sign 

(-IK 1 . I 

2.1.3.2 Perfect matchings in bipartite graphs 

The explicit definition of the determinant has some surprising applications in 
graph theory. A perfect matching in G is a set of vertex disjoint edges covering all 
vertices. The graph G is called bipartite if the vertex set of G can be cut into two 
parts X and Y so that all edges of G have one vertex in X and one vertex in Y. 
The truncated adjacency matrix of a simple bipartite graph G is the matrix B(G)=(b,j) 
in which b =1 if there is an edge between i g X and j G Y, and b= 0 otherwise. 
In other words, the rows of B represent the vertices of X, and the columns of B 
represent the vertices of Y 

Whether a bipartite graph has a perfect matching is an interesting and well- 
studied question. A sufficient and necessary condition for this existence problem 
is the well-known Marriage Theorem, which is included in most elementary 
graph theory books, such as [27]. 
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The concept of adjacency matrices provides us with a sufficient condition that 
is very easy to verify. 

THEOREM 2.21 

Let G be a bipartite graph with |Aj=|T|=?z that does not have a perfect matching. Then det 
B(G )=0 

PROOF We prove that det B(G)= 0 by showing that all n\ summands in the 
explicit definition (2.10) of B(G) are equal to 0. This is because the existence 
of a nonzero term bi Pl b 2 P2 • • • b npn would be equivalent to the existence of a 
perfect matching, namely the perfect matching in which * E X is matched to 
Pi 6 Y. | 

We also note that the number of all perfect matchings of G can be obtained by 
computing the permanent of B(G) that is defined by 

per B(G) = ^ b lpi b 2p2 * • • b nPn - 
pes„ 

That is ,perB(G) is defined just like det B(G), except that each term is added with 
a positive sign. 


2.2 Inversions in Permutations of Multisets 

Instead of permuting the elements of our favorite set, [n], in this Section we are 
going to permute elements of multisets. We will use the notation {l ai , 2° 2 , ■ • •, k ak } 

for the multiset consisting of a, copies of i, for all i E [fc]. 

For our purposes, a permutation of a multiset is just a way of listing all its 
elements. It is straightforward to see, and is proved in most undergraduate 
textbooks on enumerative combinatorics, that the number of all permutations of 
the multiset K = {l ai , 2 a2 , ■ • •, k ak } is 

n! 

ai!a 2 ! *■•»*! ’ 


where n=aOra$\ — \-a k . 

An inversion of a permutation p=pip 2 —p n of a multiset is defined just as it 
was for permutations of sets, that is, (i, j) is a inversion if i<j, but pfpf 

Example 2.22 

The multiset-permutation 1322 has two inversions, (2, 3), and (2, 4). Q 
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If we want to generalize Theorem 2.3 for permutations of multisets, that is, 
we want to count permutations of multisets according to their inversions, we 
encounter exciting and surprising connections between the objects at hand, and 
a plethora of remote-looking areas of combinatorics. 

Our goal is to find a closed expression for the sum 


E 

pes K 


( 2 . 12 ) 


where S K denotes the set of all permutations of the multiset K. We cannot 
reasonably expect something quite as simple as the result of Theorem 2.3 as the 
formula to be found will certainly depend on each of the a b and not just their 
sum n. Therefore, the reader will hopefully understand that we need some new 
notions before we can find the desired closed formula for ( 2 . 12 ). 

Let [n]=l +q+<f-\ - 'ref 1 , the polynomial whose importance we know from 

Theorem 2.3, and let [n]!=[1]!-[2]!.[n]!. Do not confuse [n]={ 1, 2, w}, 

which is a set, and [n]=l +q+<f-\ -h^ 1 , which is a polynomial. Note that if we 

substitute q= 1 , then [i]=z, and therefore [n] !=?/!, so this concepts generalizes the 
concept of factorials. The crucial definition of this Section is the following. 


DEFINITION 2.23 Let k and n be positive integers so that k<n. Then the (n, k)-Gaussian 
coefficient or q-binomial coefficient is denoted by [£], and is given by 



W![(n-k)]f 


Note that obviously, we have [k] = n-k . Also note that substituting q= 1 
reduces this definition to that of the usual binomial coefficients. This, and 
other connections between binomial and ^-binomial coefficients will be 
further explored shortly. Finally, we can define ^-multinomial coefficients 
accordingly. 


DEFINITION 2.24 Let a h (* 2 , •••, a k be positive integers so thatf2i=i a i —n. Then the 
(«!, fl 2 , ■■■, ( 2 4 )-Gaussian coefficient, or ^-multinomial coefficient is denoted by 
j 11 L and is given by 


n 

a l j a 2 j • • • i a k 


[ a i]![a 2 ]!---[a k ]!' 


We point out that similarly to multinomial coefficients, the ^-multinomial 
coefficients satisfy the identify 


n 



n 


n — ai 

n — ai — a 2 


a k 

.ai, a 2 , * * 

5 a k. 


. a i. 


. a 2 . 

a 3 


a k 
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The Gaussian coefficients look like rational functions of q, but, as we will 
soon see, it is not difficult to prove that they are in fact polynomials in q. Even more 
strongly, they are polynomials with positive integer coefficients. This is why 
sometimes they are called Gaussian polynomials. 

2.2.0.3 An Application: Gaussian Polynomials And Subset Sums 

Before we start applying Gaussian polynomials to obtain generating functions of 
multiset permutations, it seems beneficial to take a look at one of their several 
natural occurrences. The advantage of this will be that the reader will see in 
what sense the Gaussian coefficients [£] are generalizations of the binomial 
coefficients (£). That, in turn, will be helpful in putting into context the recursive 
formulae of Gaussian coefficients that we are going to use. 

THEOREM 2.25 

Let n and k befixed non-negative integers so that k<n. Let a, denote thenumber ofk-element 
subsets of [n] whose elements have sum i — that is, i larger than the minimum. Then 

we have 


ir 

k. 


k(n—k) 

a i ( l i - 


i-0 


(2.14) 


In other words, [ £ ] is the ordinary generatingfunction of tlw k-ekment subsets of [n] according 
to the sum of their elements. 


Example 2.26 

Let w=4 and 1—2. Then, among the six 2-element subsets of [4], two, namely {1, 
4} and (2, 3}, have sum 5, and all other sums from 3 to 7 are attained by exacdy 
one subset. Therefore, the right-hand side of (2.14) becomes \+q+2cf+(f+(f, 
which is indeed equal to 


□ 


4 

2 


- l)(g 4 - g) 

(q 2 - 1 ){q 2 - q) 


(q 2 + l)(q 2 +q+l). 


PROOF (of Theorem 2.25) We prove the statement by induction on n, the 
initial case of n=\ being obvious. Assume that the statement is true for n- 1 and 
prove it for n. Exercise 21 shows that 


n 

k 


= q 


n—k 


n- 1 
k- 1 


+ 


n- 1 
k 


(2.15) 


Therefore, our induction step will be complete if we can show that the poly¬ 
nomials ^ a iq i sat i s fy t ^ ie same recursive relation. That is, let b, be the 
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number of A-element subsets of [n- 1 ] whose sum of elements is i — snd let 
c,be the number of (M)-element subsets of [n-1 ] whose sum of elements is i — j; 
we then need to show that 

k(n—k) / k(n—k — 1) \ / (k— l)(n— k) 

E a,cf = ^ bif + q n ~ k • 5^ cd 

i=0 \ i=0 J y i=0 

This is the same as showing that Uj = b, + Cj,q n ~ k for all i, where undefined 
coefficients are to be treated as zero. However, the last equadon is clearly true 
as a Tsubset of [n] either does not contain n, and then it is accounted for by bj, 
or it does, and then it is accounted for by c,-(„-k), because of the shift in the 
definition of c,. | 



2.2.1 Inversions and Gaussian Coefficients 

Now we are ready to announce and prove the result describing the generating 
function of multiset-permutations according to the number of their inversions. 


THEOREM 2.27 

Let K = { l ai , 2® 2 , ■ • •, k ah } be a multiset so that , = i Oj = n, and let S K denote the 
set of all permutations ofK. Then we have 


53 q i(p) = 

p€S K 


n 

a l j a 2 j ' ' ' > &k: 


(2.16) 


PROOF First we prove the statement in the special case of k=l. In this case, K’ 
is a multiset consisting of «i copies of 1 and cui copies of 2 , so that ai+ai=n, and an 
inversion is an occurrence of a 2 on the left of a 1. We need to prove that in this 
special case, we have 


53 q i(p) = 

P e s ' K 


n 

a i 


(2.17) 


We prove this statement by induction on n. For n= 1 , the statement is trivially 
true as [J] = [J] = 1 . Now assume the statement is true for n- 1 , and prove 
it for n. A multiset permutation of K’ either ends in a 2, and then its last 
entry is not involved in any inversion, or it ends in a 1 , and then its last 
entry is involved in exactly a 2 =n-ai inversions. By the induction hypothesis, 
this means that 


E 

P es' K 


Av) 


n — 1 

a i 


+ q n ~ ai 


n- 1 

ai - 1 
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By (2.15), it is now easy to see that the right-hand side is in fact equivalent to 
j, completing the induction proof of (2.17). 

We are now in a position to prove our Theorem in its general form. We 
will do this by induction on k, the case of k=\ being trivial, and the case of 
k=2 being solved above. Assume that the statement of the theorem is true 
for K = {l ai , 2 a2 , ■ • -, k ak }, and prove that then it is also true for 
K + = {l 0l ,2 a2 ,---,fc afc ,(fc + l) 0i+1 }. 

Note that any permutation of K + is completely determined by the pair (p p 
where p’is the multiset-permutation obtained from p by replacing all entries less 
than h+i by 1, and p ” is the permutation obtained from p by removing all copies 
of k+ 1. It is then clear that 


i(p) = i(p') + i(p"), 


and that p’ and p” are independent of each other. 

Then the problem of finding ^ , qd'fi ) is clearly equivalent to the previous 

special case, and therefore we get that J ~2 p , q l ^ p - 1 = ak “ 3 . 

Now let us find YL p t: ^ If we remove all the copies of k+1, we can apply 

the induction hypothesis, and see that 


E 


J(p") 


n - a k+ i 


n — a k — ai 


a k 

a i 


a 2 


_ a k. 


Finally, as any p’ (consisting of apt-a^A - Ya k copies of 1, and <% +1 copies of 

k+\) can be paired with any p” (consisting of a,- copies of i for i g [ k ]), it 
follows that 


E 9 i(p) = E^ (p,) -E^ (p " ) 

P€S' k p' p" 


n 


n - a k+ i 

_ a k-t-l 


ai 

. 


n 



, a lj a 2 

5 

•, a k+i, 



n - a k - a x 


a k 

a 2 


, a k. 


Here the last equation is immediate from (2.13). This completes our induction 
proof. 

2.2.2 Major Index and Permutations of Multisets 

Recall that for permutations of the set [n], we found in Theorem 2.16 that the 
statistics z'and majwere equidistributed. We would like to see whether something 
similar is true for permutations of multisets. In order to be able to do that, we 
need to define the major index of multiset permutations. As a first step to that 
end, we need to define descents of multiset permutations. 
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Fortunately, both of these definitions are what one expects them to be. If 
p=p\pvpn is a permutation of a multiset, then we say that i is a descent of p if 
pppi+ 1 - Similarly, the major index of the multiset permutation p is defined by 
maj(p) = E ie D( P ) *• 

Now we are ready to state the ^-generalization of Theorem 2.16. 

THEOREM 2.28 

Let K = {l 01 ,2“ 2 , • • •, k ak } be a multiset so that ai+ap - Ya k =n. Then the statistics i 

and maj are equidistributed on t!w set S(K) of all permutations ofKIn other words, 

Y qi(p) = Y q rna dP) = 

pes(K) pes(K) 


n 

a l j a 2 > ' ‘ > a k 


PROOF As we have proved Theorem 2.16 by a bijection, this time we present 
an inductive proof. Just as in the proof of Theorem 2.27, we first treat the special 
case when K consists of a k copies of 1 and ai copies of 2 only, with a k +ao=n. 
Denote by K 2 this special multiset. In this special case, the statement is obviously 
true for n=\. Now let us assume that we know the statement for all positive 
integers less than n, and prove it for n. 

Now let us partition our permutations of K 2 according to the position of the last 
2. If the position of the last 2 is i, then we can be sure that there are no descents 
ofp in positions zTl, •••, n- 1. There is a descent at z, unless i=n. Finally, the string 
on the left of pi consists of a 2 -l copies of 2, and ay-(n-i) copies of 1. This implies 
that, by our induction hypothesis, we have 


Y g ma hP) 


= q 


i- 1 
a2 — 1 


where p ranges the permutations of K 2 in which the last 2 is in position 
fl 2 <z<?z-l, and 


Y q majip ) = 

p 


n- 1 


n- 1 

a 2 - 1 


a i 


where p ranges the permutations of K 2 which end in 2. Therefore, all we have to 
show to complete the induction step is that 


n - 1 


n —1 

i =02 


i- 1 

a 2 - 1 


n 

a i 


(2.18) 


or, equivalently, 


n 


n- 1 

a i. 


a i 


n—1 

q a2 <?* 02 

i=a 2 


i- 1 
a 2 — 1 
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and this last statement is true as it is clearly equivalent to the recursive formula 
proved in Exercise 23. This completes our induction proof for the special case 
when K=K 2 . 

Finally, to prove the theorem for general K, we can proceed by induction on k, 
very much like in the proof of Theorem 2.27. The details are similar to that 
proof, and are left to the reader. 

As we have mentioned, there are many interesting occurrences of Gaussian 
coefficients in combinatorics. Perhaps the most direct one is the following. 

THEOREM 2.29 

Let q be a power of a prime number, and let V be an n-dimensional Vector space over the 
q-elementfield. Then the number of k-dimensional subspaces ofV is [£]. 

PROOF First, let us choose a A-tuple of vectors in V that form an (ordered) 
basis for a /('-dimensional subspace. For this, we have to choose k linearly 
independent vectors from our vector space V. For the first basis vector Vi, we 
can choose any vector in V except 0, so we have q n -1 choices. For the second 
basis vector, we cannot choose any multiples of Vi, therefore we have only q"- 
q choices. For the third vector, we cannot choose any of the q 1 possible linear 
combinations of Ui and v 2 , yielding q"-f choices, and so on. Iterating this 
argument, we see that we have 

{q n - l)(q n - q) ■ ■ ■ (q n - q*- 1 ) (2.19) 

choices for an ordered basis of a ^-dimensional subspace of V. It goes without 
saying that any such subspace has many ordered bases. In fact, repeating the 
above argument with k playing the role of n shows that the number of ordered 
bases of a ^-dimensional subspace is 

So this is how many times each ^-dimensional subset of V is counted by (2.19). 
Therefore, the number of such subspaces is 

(q n - l){q n - q) ■ ■ ■ (q n - if- 1 ) _ (, q n - 1 )(q n ~ 1 -!)•■• ( q n ~ k+1 - 1 ) 
(q k - 1 ){q k - q) ■ ■ ■ {q k - q k ~ l ) (q k - 1 )(q k ~ 1 - 1 ) •••(«- 1 ) 

[n][n - 1] • • • [n - k + 1] rnt 

M “hf 

■ 

We will see some alternative interpretations of the Gaussian coefficients in the 
exercises. 
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Exercises 

1. +Let us generalize the notion of Eulerian polynomials as follows. Let 

B n (x) = (-l) i(p} x 1+d(p) . 

p€S n 

That is, the only difference between this definition and that of A„(x) is 
that here the parity of the number of inversions is taken into account. 
Prove that 

B 2n (x)=(l-x) n A n (x), 

and 

B in+ fx)=(l-x) n A n+ fx). 

2. Following the line of thinking found in Exercise 17 of Chapter 1, define the 
r-major index of p, denoted by rmaj(p) as 

rmaj(p) = ^ * ) + |{(*,i) : 1 < i < j < n, Pi > Pj > Pi ~ r}|, 

\ieRD(p) J 

where RD(p) denotes the set of all r-descents oip, as defined in the mentioned 
Exercise. 

(a) Explain why the r-major index is a generalization of both the number 
of inversions and the major index. 

(b) Prove that for any positive integers r, the r-major index is a Mahonian 
statistic. 

3. Prove (without using the general formula for b(n, k)) that b(n, 3)= 

4. Prove (without using the general formula for b(n, k)) that b(n, 4)= 

(T) - (T)^ 4 - 

5. Let us call a 2w-permutation p=pp>r"p 2 n 2-ordered if ■••<p 2 „-1 and 

p 2 < <ps < C-" < C p 2 n- Prove that 

]Ti(p)=n4"-\ 

p 

where the sum is taken over all 2-ordered 2n-permutations p. 

6 . Let m> 1 be a positive integer, and let j be a nonnegative integer, with 
/<m. Prove that if n is large enough, then the number of w-permutations p 
for which i(p)=j mod(m) is independent of j. 
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7. Let L be a rooted tree with root 0 and non-root vertex set [n]. Define 
an inversion of T”to be a pair (ij) of vertices so that i>j, and the unique 
path from 0 to j goes through i. How many such trees have zero 
inversions? 

8. Let p E S n have n -2 descents. What is the minimal and maximal possible 
value of i(p)2 

9. It follows from Lemma 2.12 that 

Y p^ n ~ a W) = Y p( n ~ a C?'))> 

j even j odd 

where a(j)=(3f+]l2. Find a direct bijective proof of this identity. 

10. Let p = p\p 2 “'p n be a permutation, and let our goal be to eliminate all 
four-tuples of entries (p a , p h , p c , pp) in which «<7><£<d and p a <p><tp b <p) d . 
In order to achieve that goal, we use the following algorithm. We 
choose a four-tuple F with the above property at random, and 
interchange its two middle entries. By doing that, we took away the 
undesirable property of F, but we may have created new four-tuples 
with that property. Then pick another four-tuple with that property, 
and repeat the procedure. 

Prove that no matter what p is, and how we choose our four-tuples, this 
algorithm will always stop, that is, it will eliminate all four-tuples with the 
undesirable property. 

11. Is it true that b(n, k) is a polynomially recursive function of n for any 
fixed k ? 

12. Let p(n, k) be the number of partitions of n into k parts. Let P(x)= 
YP™- 1 p{n,k)x k . Does there exist an integer n >2 so that P(x) has real 

zeros only? 

13. Let B n be the set of all w-tuples (b u b 2 , ■■■, b n ) of nonnegative integers 
that satisfy bi<i -1 for all i. How many elements of B n satisfy 
£”=i bi = k? 

14. Let B n be defined as in Exercise 13, and let B(n, k) be the number of n- 
tuples in B n that have exactly k different entries. Find a formula for 
B(n, k). 

15. Express b(n, k) using summands of the type b(n-l, i). 

16. Compute the value of ££ =0 (— l) k b(n, k). 

17. Let p 6 S n have n -1 alternating runs, and assume that n= 2k+l. What is 
the minimal and maximal possible value of i(p)2 
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18. (a) +Let A = {l ffll , 2" 2 }, and assume that a, and a 2 are relative primes 

to each other, with a 1 +a 2 =w. Let I(A, k) be the number of permutations 
p of A so that 

i(p)=k mod n. 

Prove that I (A, k) = ~ ( c ") for all k. 

(b) +What can we say about I(A, k) if a± and a^ have largest common 
divisor d>l ? 

19. +The Denert statistic, denoted by den, is defined on S n as follows. Let 
p G S n , then den(p) is the number of pairs (l, k) of integ ers satisfying 
1 <Kk<n, and one of the conditions listed below 

Pk <Pi < k, 

pi <k < pk- 

k <Pk <Pi- 


So for instance, den{ 132) =2 as the pair (2, 3) satisfies the first condition, 
and the pair (1, 2) satisfies the second condition. Prove that the Denert 
statistic is Mahonian. 


20. We know that (£) is the number of northeastern lattice paths from (0, 0) 
to (k, n-k). Extend this correspondence to one that provides an 
interpretation for 


21. Prove by way of computation that 


n' 

k. 


n- 1 

k 


+ q 


XI—k 


n- 1 

k- 1 


22. Prove the identity of the previous exercise by a combinatorial argument. 

23. Prove that 


n=r- k - 

m- 1 

+ q m ~ k ~ 1 ■ 

m — 2 

+ ••• + £/• 

k+1 

+ 

k 

L k J 

k - i 


k - 1 


k 


k 


24. Prove by way of computation that is always a polynomial with non¬ 
negative integers as coefficients. 

25. Prove that 

= q n P{fk,n ), 

n>0 


i + k 
k 


where p(i, k, n) is the number of partitions of the integer n into at most i 
parts of size at most k each. 
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27. Let m<n, and let 

A m (n)={ 1, 1, 2, 2, m, m, m+ 1, m+ 2, n}. 

Let a m (n) be the number of all permutations of the multiset A m (n) in which 
12--W occurs as a subword. (The letters of this subword do not have to be 
consecutive entries of the permutation.) Prove that 

a m +i(n) — (n + 2 m)a rrl {n) - m(n + m)a m -i(n). 

28. Let n < k < (”)■ Prove that 

b(n+ 1, k)=b(n+ 1, k-\)+b b(n, k)-b(n, k-n-1). 

29. +Find a formula for 


n 



30. +Consider the following refinement of the Eulerian polynomials. Let 


An Ad) = X> maJ ' (P) , 


P 


where the sum is taken over all ?/-pcrmutations having k-1 descents. 
These polynomials are often called the ^-Eulerian polynomials. Prove 
that 



Problems Plus 

1. Let a(k, l) be the number of w-permutations having k descents and 
major index l. Let d(k,l) be the number of w-permutations q having k 
excedances and satisfying den(q)=l. Prove that a(k, l)=d(k, l). This 
fact can be referred to by saying that the den-exc statistic is Euler- 
Mahonian. 

2. Find a permutation statistic s: S „—>N so that the number c(k, l) of n- 
permutations p for which s(p)=k and i(p)=l is equal to a(k, l) of the 
previous problem. In other words, find a statistic s so that the joint 
statistic s-i is Euler-Mahonian. 
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3. Prove that the joint statistic (dmc, maj) is Euler-Mahonian. 

4. Let 1 <k<n. Prove, using the polynomial I n (x), that the number of n- 
permutations p for which 

maj(p)=j (mod k) 

does not depend on j. 

5. Define the (q, ^-Eulerian polynomials by 

A[n,k,r] = Y, q rmaj(p) - 

p€S n 


Prove that 


A[n, k, r]=[r+k] A[n- 1, k, ?\+q k + r - 1 [n+l-k-r\ A[n- 1, k-\, r], 
where [m]=\+q+if H- 

6. A parking function is a function^: [n]-^[n] so that for all * G n there are 
at least i elements j £ [n] for which f()=i- Prove that the number of 
parking functions on [n ]satisfying l f(J) = ( 2 ) — ^ ec [ ua l t0 the 
number of rooted trees with root 0 and non-root vertex set [n] that 
have k inversions. (See Exercise 7 for a definition of inversion in a 
tree.) 

7. Prove that the Gaussian polynomial [”] has unimodal coefficients. 


8. Let A ~ {l ai , 2 “ 2 and let d be the largest common divisor of a, and a 2 . 
Now let J(A, k) be the number of permutations pot A that have first entry 
1, and for which 

i(p)=k mod a 2 

holds. Let 0 < t < ^ — 1. Prove that 


J(A, k) 


d b n — 1\ 
ai \di ~ 1 / 


Note the difference from Exercise 18. Here we are looking at residue classes 
modulo ci\, not modulo n. 

9. Log-concavity is a concept for sequences of numbers, but it can be extended 
to a concept for sequences of polynomials as follows. 

Let pa(q), pi(q), ■■■, p m (q) be a sequence of polynomials with nonnegative 
coefficients. We say that this sequence is q-log-cmcave if the poly-nomial 
p‘ 1 id) ~ Pk-i(q)Pk+i (d) has non-negative coefficients for all k. 

Prove that for any fixed n, the sequence of polynomials [“], [“] 

is ^-log-concave. 
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10. Let us extend the notion of unimodality to polynomials as follows. Let 
po(q), pi(q), ■■■> pm(q) be a sequence of polynomials with nonnegative 
coefficients. We say that this sequence is q-unimodal if there exists an index 
j so that 0 <j<m and for all i, the polynomial pj(q)p>i(q) has nonnegative 
coefficients. Note that a ^-log-concave sequence does not have to be q- 
unimodal. Prove that any fixed n, the sequence of polynomials 
[ 0 ] > [r ] j ''' s [n]^ s ^tinimodal. 

11. Generalize the result of Exercise 27 to the multiset 

A m ,n,r = {l r+ \ 2 r+1 , • • •, m r+1 , (m + If, • • •, n r } 

as follows. Let a m r (n) be the number of permutations of A„ jV that contain a 
subwords consisting of r copies of 1, then r copies of 2, and so on, ending 
with r copies of n. Again, the letters of the subword do not have to be 
consecutive entries in the permutation. Prove that 

a m ^i r (n)=(m+2m-r+l) a m , r (n)-m(rn+m) a^/n), 
where ao, r (n)=l, and a ir (n)=m+\-r. 


Solutions to Problems Plus 

1. This result is due to D.Foata and D.Zeilberger [91], who proved it by 
providing alternative interpretations for the Denert statistic. In particular, 
they showed that 

dm(p)=iprip — \-i m +i(Exc p)+i(Nexc p), 

where z], i^, i m are the excedances of p, while Exc p is the substring 
PhPi 2 ''' Pim, and Nexcp is the substring obtained from/> by removing Exc 
p. In our example in the text, 132, we get that 

dCTz(132)=2+0+0=2, 


as we should. 

2. Such a statistic was given by M.Skandera in [172]. 

3. This result was obtained by D.Foata, [88], who first proved that the jointed 
statistics (d, i) and (d, maj) had the same distribution. 

4. For k=n, the statement means that there are (n-1)! permutations in S„ so 
that maj(p)=j(vcsodrt). This result was first proved in [18], using a heavy 
algebraic machinery. In that same paper, the authors provided a bijective 
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proof as well, but that still used Standard Toung tableaux and the Robinson- 
Schensted correspondece, which we will cover in Chapter 6. The general 
statement for k 6 [n] was given the following simple and beautiful proof in 
[19]. We know from Theorems 2.3 and 2.16 that 

y x ma 7(p) = I n (x) = (1 + ®)(1 + x + x 2 ) ■ ■ • (1 + x +-1- x n ~ L ). 

p€S n 

If we count our permutations according to the remainder of the major 
index modulo k, then we have to take the above equation modulo the 
polynomial x*-l. If x is any /ah root of unity other than 1, then the left-hand 
side vanishes as there is at least one factor on the right-hand side, 
q_ x -|_-p x k ~ 1 ) = xh Ii vanishes. Therefore, 

y X ma ^ = C (1 + X + ■ - - + x k ~ l ) mod(x k - 1) 
p£S n 

as the left-hand side and the right-hand side have k-\ common roots. Setting 
x=l, we get that c=n\lk, and that 

I n (k) = ^ ■ (1 +* + ••• + x k ~ l ) mod(x fc - 1). 
k 

That proves that the number of w-permutations p satisfying 
nul j(p) = j {mod li) 

is n\lk, proving our claim. 

5. This result is due to D.Rawlings [160], 

6. This result, in a slighdy different form, was found by G.Kreweras [140]. 

7. There are several proofs of this fact that had first been noticed by Cayley 
at the end of the nineteenth century. Some of these proofs are reasonably 
short, but use sophisticated machinery. See [167], or [159] for such 
proofs. An elementary proof was given by K.O’Hara [155], who used 
the subset sum interpretation of Gaussian coefficients in her proof. Her 
argument was later explained in an expository article by Zeilberger 
[205], 

8. This result was proved in [53]. The authors showed that p=pip 2 ---pn, 
then exactly d of the n cyclic translates pr"pf>u ■■■, p n pvp 2 have first 
entry 1 and inversion number k modulo a\. spCy 1 ].) = n("i)> this 
proves the result. Note that the result is identical to the result of 
Exercise 18 (b), even if in that exercise we counted different 
permutations. 

9. This result was proved in [54]. 
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10. This is a special case of a more general result of L.Butler [55], which is of 
group-theoretical flavor. In her proof, Butler uses the interesting fact that 
the number of subgroups of order (j‘ of the Abelian group Z™ is the Gaussian 
polynomial [”]. where q is a prime. 

11. It is proved in [105] that the exponential generating function of the sequence 
a mr (n) is 



from which the proof of our statement follows. In [203], L.Yen sketches a 
bijective proof. 
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In Many Circles. Permutations as 
Products of Cycles. 


3.1 Decomposing a permutation into cycles 

So far we have looked at permutations as linear orders, that is, ways of listing n 
objects so that each object gets listed exacdy once. In this Section we will discuss 
permutations from a different viewpoint. We will consider them as Junctions. Let 
us redefine permutations along these lines. 

DEFINIT’ION3.1 Letf: [n]—>[n] be a bijedion. Then we say that f is a permutation of 
the set fnj. 

This definition certainly does not contradict our former definition of permutations. 
Formerly, we said that 34152 was a permutation of length five. Now we can 
reformulate that sentence by saying that the function f: [5]—>[5] defined by/jl)=3, 
y(2)=4,y(3)=l,y(4)=5, andyj5)=2 is a permutation of [5]. Going backwards, the 
one-line notation simply involved writingyjl)/(2)"^/"(w) in one line. 

This new look at permutations makes another way of writing them plausible. 
We write 

1 2 3 4 5 
3 3 4 1 5 2’ 

expressing that f maps 1 to 3, 2 to 4, 3 to 1, 4 to 5, and 5 to 2. 

This notation is called the two-line notation of permutations. It is more 
cumbersome than the one-line notation, which consists of writing the second 
line only, but it has its own advantages as the reader will see shortly. 

Let / and gbe two permutations of [n]. Then we can define their product fg 
by (f-g) &=g(f&) for i e [nj. It is straight forward to verify that the set of all 

permutations of [n] forms a group when endowed with this operation. Therefore, 
the set of all permutations of [n] is often denoted by S n and is called the symmetric 
group of degree n. We note that for ?z>3, the group S n is not commutative, so in 
general, fgtgf. 

The symmetric group is a quintessential ingredient of group theory. It is well 
known, for instance, that every finite group of n elements is a subgroup of S„. 

73 
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FIGURE 3.1 

The cycles ofy=34152. 


Extensive research of the symmetric group is therefore certainly justified. In this 
Chapter, we will concentrate on the enumerative combinatorics of the symmetric 
group, that is, we are going to count permutations according to statistics that are 
relevant to this second way of looking at them. 

A closer look at our running example, the permutation7=34152 reveals that 
f permutes the elements 3 and 1 among themselves, and the elements 2, 4, 5 
among themselves. That is, no matter how many times we apply /, we will always 
have^(3)=l ory*‘(3)=3, and/’/S) will never have any other values. In other 
words, / cyclically permutes 1 and 3, and f cyclically permutes 2 and 4 and 5. 
This phenomenon is illustrated in Figure 3.1. 

This phenomenon is highlighted by a third way of writing the permutation f 
We write7=(13) (245), and call it a cycle notation of f. When reading a permutation 
in this notation, we map each element to the one on its right, except for the 
elements that are last within their parentheses. Those are mapped to the first 
element within their parentheses. 

Example 3.2 

The permutation g=(12) (356) (4) is the permutation 

_ 1 2 3 4 5 6 
9 ~ 2 1 5 4 6 3' 


n 

Several comments are in order. First, if the cycles are disjoint, the order among 
the cycles clearly does not matter, that is (12) (356) (4)=(356) (4) (12)= (4) (12) 
(356), and so on. Indeed, the image of each i 6 \n] only depends on its position 
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within its cycle, and some other elements in its cycle. Second, if a cycle has at least 
three elements, then the order of elements within that cycle matters, up to a 
certain point. Indeed, the cycles (356), (563), and (635) describe the same action 
of g on elements 3, 5, and 6, but the cycle (365) describes a different action. This 
action maps 3 to 6, not to 5 as the previous actions did. A little further 
consideration shows that each cyclical action on k elements can be described by 
writing k different cycles as one can start with any of the k elements. 

We would like to have a unique way of writing our permutations using the 
cycle notation. Therefore, we will write the largest element of each cycle first, 
then we will arrange the cycles in increasing order of their first elements. This 
way of writing permutations will be called their canonical <ycle notation. 

Example 3.3 

Permutation (312) (45) (8) (976) is in canonical cycle notation. 

3.1.1 An Application: Sign and Determinants 

The cycle decomposition of a permutation / contains some crucial information 
about f. For instance, if we know the cycle lengths of f we can compute the 
smallest positive integer m for which f m is the identity permutation. This number 
m is called the order of f in group theory. Indeed, m is obtained as the smallest 
common multiples of all cycle lengths of f. 

A common way of fitting permutations into algebraic frameworks is by defining 
permutation matrices, that is, square matrices whose entries are all equal to 0 or 1, 
and that contain exaedy one 1 in each row and each column. There are two 
equally useful ways to do this. Ee.t p=pip 2 —p n be an w-permutation, and let A p be 
the matrix nXn matrix in which 



It is then straightforward to check that the map f: .S„—>R” X ” defined hyf(p)=A p is 
a homomorphism, that is, Apq A p -A q . 

Example 3.4 

/f/=2413=(4312), then we have 


/ 00 1 0 \ 
10 0 0 
0 0 0 1 
\0 1 0 0 / 


□ 
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Now let Bp be the nxn matrix defined by 

B a j) = 1 1 if Pi = j > 

p J j 0 otherwise. 

The following example illustrates how closely the matrices A p and B p are 
connected. 


Example 3.5 

If jb=2413=(4312), then we have 

/ 0100 \ 

0 0 0 1 

p 1000 ' 

\ 0010 / 

□ 

Clearly, for any p, the matrix B p is the transpose of A p . Furthermore, it is 
straightforward to check that ApB p =BpA p =I, so the matrices A p and B p are inverses 
of each other. In other words, the inverse of a permutation matrix is its own 
transpose. 

Comparing the definitions of A p and B p , one could ask when is the first one 
easier to use, and when is the second one easier to use. The advantage of the 
first definition is that as we said, the map f: 5„—>R" X " defined by f(p)= A p is a 
homomorphism. This is not true for the map g: S„— >R” X ” defined by g(p)=B p . 
That map is an anti-homomorphism, that is, B p =B^-B p . This can be useful when we 
use our matrices to permute vectors of size n. 

For instance, let 

(xi\ 

X2 

X — 

£3 

\x 4 J 


If the permutation p=2413=(4312) acts on the coordinates of this vector, 

X4 


it takes the vector x into p(x)= 


Xi 


. We could have obtained this vector 


w 


by simply taking the product Bps.. If now another permutation q acts on the 
vector p(x), then we can obtain the image by computing Bq(Bpx)=B pq (x). If, instead 
of the column-vector x, we had worked with the row-vector y&ys), then 

we would have used the matrices A p and A q , to compute the images yA p and 

yAp^yMr 
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DEFINITION3.6 A permutation is called odd ( resp. even) if it has an odd {resp. even) 
number of inversions. 

The following Proposition shows that we have to be careful throwing around the 
words “odd” and “even” when describing permutations. Let us call a cycle even 
(resp. odd) if it consists of an even (resp. odd) number of elements. 


PROPOSITION 3.7 

A permutation that consists of exactly one even cycle is odd. A permutation that consists of 
exactly one odd cycle is even. 

PROOF We prove the claim by induction on the length n of the only cycle of 
our permutation^. For re=l and n=2. the statement is trivially true. Now let w>3, 
and consider the cycle {pf^'ff It is straightforward to verify that 
(p\p r i" if) = (p\p r i"'pit\) (p^\pn) ■ The multiplication by (p, tl p^) at the end simply swaps 
the last two entries of (pipvp, u), and therefore, either increases the number of 
inversions by one, or decreases it by one. So in either case, it changes the parity 
of the number of inversions. The proof is then immediate by the induction 
hypothesis. | 

The omnipresence of this notion is illustrated by the following Lemma. 

LEMMA 3.8 

Let p be a permutation. Then tlw following are equivalent. 

& P is even , 

(ii) det A p =det B p = 1, 

(iii) the number f even cycles of p is even. 

PROOF 

• (*) 4 $ (if. This follows from the definition of determinants, as given in 
Theorem 2.19 because the determinant of a permutation matrix B p has 
only one nonzero term, and that is (— 1 )*(p) b ip ({y The claim follows 
as each term after the product sign is equal to 1. 

• (if (my This is true as p is the product of its cycles. By Proposition 3.7, 
even cycles correspond to odd permutations, so the determinant of their 
permutation matrices is T. Therefore, there has to be an even number of 
them for the determinant of their product to be 1. 

■ 
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In particular, the product of two even permutations is even, and the inverse of 
an even permutation is even. As the identity permutation is even, this proves the 
following. 

PROPOSITION 3.9 

The set of all even permutations in S„forms a subgroup. 

This subgroup is called the alternating group of degree n, and is denoted by A n . 
The reader should prove at this point that A n has n\l 2 elements if n>2, then she 
should check her answer in Exercise 1. Like the symmetric group, the 
alternating group has been vigorously investigated throughout the last century. 
For instance, it is known that A n is a simple group if ii> 5, and among all finite 
simple groups, A x is the easiest to define. (See any introductory book on group 
theory for the definition of a simple group.) It is also interesting that A„ is by far 
larger than any other proper subgroup of S n . Indeed, other proper subgroups of 
S„ are of size at most (wT)!. Can you find such a subgroup? 


3.1.2 An Application: Geometric transformations 

Some crucial properties of geometric transformations are easy to read off if we 
write the corresponding permutation into the product of simpler 
permutations. For starters, consider a regular hexagon H. Apply various 
symmetries to this hexagon. It is a well-known fact in group theory, and is not 
difficult to prove, that there are 12 such symmetries, and that they can all be 
obtained by repeated applications of r, which is a reflection through a fixed 
axis a, and t, that is a rotation by 60 degrees counterclockwise. See Figure 3.2 
for an illustration. 

We point out that both transformations rand / correspond to permutations of 
the set [A, B, C, D, E, F), or, after relabeling, [6]. In particular, r= (43) (52) (61), 
and £=(123456). 

It is natural to ask the following question. Given a series of symmetries, such 
as s=trrtrtrrtr, how can we decide whether the composite transformation can be 
realized just by moving our hexagon in its original 2-dimensional plane. 

This question is easy to answer if we have the transformation given in the 
above form. Clearly, transformation r changes the orientation of the hexagon, 
and therefore cannot be realized by a 2-dimensional movement. On the other 
hand, t does not change the orientation of our hexagon. Consequently, if and 
only if our composite transformation contains an odd number of reflections r, 
then it changes the orientation of H, and therefore cannot be realized by a 2- 
dimensional movement. The example s of the previous paragraph contains six 
reflections, and is therefore realizable in the plane. 

Similar considerations are helpful in deciding whether a certain symmetry of 
a 3-dimensional solid, such as a cube, can be realized by moving the cube in the 
3-dimensional space. 
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FIGURE 3.2 

Two symmetries of a regular hexagon. 


3.2 Type and Stirling numbers 

We start with two well-known and basic enumeration problems. It is natural to 
ask how many w-pcrmutations have a given cycle structure -for instance, how 
many 12-permutations consist of a 4-cycle, two 3-cycles, one 2-cycle, and zero T 
cycles. It is also natural to ask the more inclusive question of how many n- 
permutations have exacdy k cycles. 

In order to facilitate the answer of the first question, we make the following 
definition. 


3.2.1 The type of a permutation 

DEFINITION 3.10 Let p be an n-permutation that has exactly a, cycles of length i, for all 
positive integers i g [nl. Then we say that p is of type («i, < 22 , —, «„). 
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Example 3.11 

The permutation p=(21) (534) (6) (987) is of type (1, 1, 2, 0, 0, 0, 0, 0, 0). [] 

The number of permutations with a given type is fairly easy to obtain. 

PROPOSITION 3.12 

Let («!, < 22 , «„) be an n-tuple of nonnegative integers so tlmt fffi a-i • i =n. Tlun the 

number ofn-permutations of type (a h a , a„) is 

nl 

ailaf. ■ ■ • a n !l ai 2° 2 • • •n“" ' 

PROOF Let us write the elements of [n] in a linear order, in one of n\ possible 
ways. Then let us place parentheses between the numbers so that the first a, 
entries form the a, cycles of length one, the next a, 2 entries form the a 2 cycles of 
length two, and so on. The permutations we obtain this way will all be of type 
(ai, fl 2 , ( 2 „), but they will not be all different. Indeed, the sets of entries forming 
cycles of the same length can be permuted among each other without changing 
the resulting permutation. Therefore, we obtain each permutation from ap. arj. 
■■■a„! linear orders, due to permuting cycles of the same length. Finally, each i- 
cycle can be obtained in i different ways as any of its i entries can be at the first 
position. So even if we keep the sets of entries in each cycle fixed, there are 
] 2... fi a n different linear orders that could lead to any given permutation of 

type (a h a 2 . ■■■, a „). This shows that on the whole, each permutation of the desired 
type is obtained from ai\af. • • • a n !l ai 2“ 2 • • ■ n 0 ’ 1 linear orders, and the statement 
is proved. 

For instance, there are (w-1)! permutations of length n that consist of one n- 
cycle, and there are permutations of length 2 n that consist of n cycles of 

length 2. 


3.2.2 An Application: Conjugate permutations 

In the symmetric group S n , two permutations g and h are called conjugates of 
each other if there exists an element / £ S n so that fgf=h holds. In group 
theory, it is often very useful to know that two elements are conjugates as they 
share many basic properties. Fortunately, the cycle decomposition of g and h 
reveals whether they are conjugates or not. This is the content of the next 
lemma. 

LEMMA 3.13 

Elements g and h of S n are conjugates in S„ fand only f they are of the same type. 
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PROOF Recall that if gand h are two w-permutations, then the action of gh on [n] 
is obtained by first applying g to [n], and then applying h to the output. 

First assume that g and h are conjugates, that is, Jgf 1= h for some 
/ € S„. Let (6162 ■ • • bfc) be a cycle of g. Then g'{b\ ) = 6 1 + i, for all indices i € 
[fc-1], andg k (bi) = 61 . Ash = fgf~ x , we have h' = fgf~ l fgf~ 1 --fgf~ 1 = 
fg l f~ l . Therefore, h'(x) = ( fg l f~ l )(x )• 

Now choose x so that f(x) — b\. Then (fg , f~ 1 )(x) = /~ 1 (<?'(bi)) = 
/ _1 (6 i+ i) if i 6 [k - 1], and ( fg k f~ 1 )(x) = Thus multiplying by/ 

from the left and f l by the right turns the cycle (b b ■■■b) into the cycle 

Therefore, the A-cycles o^gare^in bijecdon with 
the ^-cycles of h for all k, and the “only if” part of our lemma is proved. 

Now assume that g and h have the same type. We construct a permutation/ 
so that / gf~ l = h. If (6162 • • • 6 *) is a cycle of gand fq- is a cycle of h, then 
the argument of the previous paragraph shows that we must choose/so that/ 
1 (b i )=c i for f~ l (bi) = Ci for i S [jfc]. This defines/ 1 for k entries. To find/ 1 for 
the remaining n-k entries, proceed similarly for all the remaining cycles. 


Remark We point out that in the second paragraph of this proof we showed 
that conjugating by / turned the cycle {bib 2 ---b t ) into the cycle 
(/ _1 (61 )/ _I (by) ■ ■ ■ }~ l ( bk )) ■ This simple fact is often used in similar arguments. 
We will see one application in the next subsection. 

It is straightforward to prove that the relation “g and h are conjugates” is an 
equivalence relation. The equivalence classes created by this relation are called 
conjugacy classes. The following simple consequence of the previous lemma is of 
fundamental importance in representation theory. 

COROLLARY 3.14 

The number of conjugacy classes fS n is p(n). 


3.2.3 An Application: Trees and Transpositions 

We have seen that the decomposition of a permutation into disjoint cycles is 
unique, up to the transformations discussed above. We have also seen that the 
multiplication of disjoint cycles (viewed as permutations) is a commutative 
operation. However, if we drop the condition that the cycles be disjoint, 
everything falls apart. Indeed, we have (12) (13) =812 and (12) (13) =312? 
(13) (12)^231. as counterexamples. In fact, it is not hard to see that any 
permutation in S n can be obtained as a product of (not necessarily disjoint) 2- 
cycles, or, in other words, transpositions. See Exercise 11 for a stronger version 
of this fact. 

While the topic of generating permutations from given sets of not necessarily 
disjoint cycles belongs more to group theory than to combinatorics, the following 
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results of Denes [69] certainly have a combinatorial flavor. Let us call a 
permutation cyclic if it consists of one cycle only. 

LEMMA 3.15 

Let s 2 , J 3 , s„ be transpositions of S„. Then the product s^y-s 2 is equal to a cyclic 
permutation f and only if the graph G(s 2 , j 3 , ■■■, s n ) with vertices 1, 2, ■■■, n and edges s 2 , 
s 3 , ■■■, s n is a tree. 


In the unlikely event that the reader is not familiar with basic graph theory, the 
relevant definitions can be found in any introductory combinatorics book, such 
as [27], 

PROOF Assume that s n s„ A --s 2 is cyclic. Clearly, for any i, m e [n], there is a 
path between i and s m s mA ---s 2 (i) in our graph. As our graph G(s 2 , j 3 , •••, t„) has 
all its elements in one cycle, this means that G(s 2 , •••, ,s„) is connected. On 

the other hand, by definition, our graph has n-1 edges, therefore it has to be 
a tree. 

Now assume that T is a tree with vertex set [nj with edges s 2 , j 3 , •••, s n . Omit 
edge s„, then T splits into two smaller trees, say T and T'. By induction, each of 
these trees corresponds to a cycle, say Gi and C 2 . As these cycles are disjoint, we 
have 


\ s ^2- s 2=M- C i C ^ 

where Sy(aicf), with a\ £ C\ and a 2 £ CV With all these conditions, s n s n .ys 2 
must be a cyclic permutation. 

One of the best-known theorems of graph theory is Cayley’s formula, stating 
that the number of elements of the set T„ of all trees on vertex set [n] is re”" 2 . See 
[27] for some proofs. The previous Lemma enables us to link this result to cyclic 
permutations. 


THEOREM 3.16 

Let p be a given cyclic permutation oflength n. Then the number of ways to decompose p into 
the product ofn-1 transpositions is if- 1 . 

This result was first proved by Denes in [69]. The proof we are presenting is due 
to Moszkowski [150]. 

PROOF Clearly, the choice of p is irrelevant as one can always relabel the 
entries ol p, so we may assume that p=(122>--n). We are going to construct a 
bijection h: T n —»C n , where C n is the set of all (wT)-tuples ( s n , s nA , ■■■, s 2 ) of 
transpositions satisfying s n s n .iS^ 2 —s 2 = (12—ri). 
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Labeling the edges of 7! 


Let T e T n Then for every vertex i g {2,3, ■ • •, n}, there is exactly one 

path from 1 to i in I. If (a, i) is the last edge in this path, then we label the edge 
(a, i) by sj- See Figure 3.3 for an example. 

Let s ; denote the transposition interchanging the two endpoints of s t . Set 
Ct — s„ T s n _] T ■ ■ ■ S 2 T > then Cyis a cyclic permutadon because of the previous 
Lemma. As an intermediate step in constructing our bijection h, we define the 
permutation// by the formula 

f T (k)=C k T ~ 1 ( 1), 

for i<k<n. The reader is invited to verify that 

h-Cr-ff 1 =(123 ■•■«). (3.1) 

An explanation is given in the solution of Exercise 15. Moreover, we set 

uj = frs-Jff 1 (3.2) 

for 2 <i<n. By Lemma 3.13, we see that uf is also a transposition as it is the 
conjugate of transposition s i . Furthermore, 

u n ■ u l-l ■■■ u 2 = IT' S n ‘ ■ ff 1 ■ f T Sn-1 T ' ff' ‘ ' ' ' It ’ ^ ■ ff 1 

= fTS n T s n - 1 T ■■■s 2 r ff 1 = (12 ■■■n), 

where the last line holds because of (3.1). {uf, uf_ ,, • ■ •, uf) € C- Now we 
define h (T)=uf, u f_ u - ■ ,uf 

We claim that h: T„—>C„ is a bijection. To see this, we will show that h has an 
inverse; in other words, that every element of C n has exactly one preimage 
under h. 
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T 




FIGURE 3.4 

The connection between the labels of the two trees. 


(A) First we show that C E C n cannot have more than one preimage. Assume 
that (u n , u n - 1 , • • •, U 2 ) E C- Then u„-u„ rl ---U2=(12---n), and by the previous 
Lemma, the graph G(u n , u^, ■■■, u 2 ) with vertex set [n] and edges u n , u n . h ■■■, 
u 2 is a tree. 

(i) If T is a tree so that h(l)=u n -u n .i"-U 2 , then the transpositions s, are 
uniquely determined by the m, and formula (3.2). In fact, it is 
straightforward to check that the trees T and G(u n , 244 , ■■■, zz 2 ) are 
isomorphic as unlabeled trees because T is obtained from the tree 
G(u n , u n .i, ■■■, w 2 ) by applying the permutationfr to the vertex set [n]. Indeed, 
we get 

Si T = fp l Uif T , 

and as we pointed out in the Remark after the proof of Lemma 3.13 
the effect of conjugating a cyclic permutation (in this case, a 
transposition) by f~ l is precisely the same as applying j r to the 
underlying set. 

(ii) On the other hand, we havc //{l ) by definition, and we also know 
that for i> 1, the last edge of the path from 1 to z in T’is s T ■ Therefore, 
if in G(u n , M, tl , •••, m 2 ), the endpoint of the path from 1 to the edge w, is 
x, then we must haveTr^^- 

See Figure 3.4 for an illustration of these labels. 

We have seen in part (i) that T as an unlabeled tree is uniquely determined 
by ( u n , 244 , •••, m 2 ), and then we have seen in part (ii) that the labels of T are 
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also uniquely determined by (u„, u n . u —, u 2 ). So if a T satisfying h(T)=(u n , u m 
i, ■■■, u 2 ) exists, it must be unique. 

(B) Now we show that each (u n ,u n - 1 , • ■ •, uf) € C n has a preimage under h. 

Let T(u n , Un. i, ■■■, u 2 ) be the tree obtained from G(u n , u n i, ■■■, u 2 ) by applying 
the permutation^"defined by f(x)=i to the vertices, for 2 <i<n. (We know 
from part (A) that this is the only tree that has a chance to be the preimage 
of ( u„ u n 4 , •••, u 2 ). Here x,-is defined as in (ii) of part (A). We will show that 
h(T)={u m u n _ h u 2 ), so Tis the preimage we are looking for. By construction, 
the edges of T are thefuf For shortness, let us write s=fuf.Then we can 
compute Cy lor our candidate T as follows. 

as u n -u n _ 1 —uz=(12--ri). As the next step, we compute fr(k) for P. 
f T (k) = C k T - 1 (l) = f(k). 

Therefore, ff=f, so indeed, h(I)=(u„, u n . h ■■■, u 2 ) as claimed. ^ 


3.2.4 Permutations with a given number of cycles 

Let us return to the enumeration of permutations with conditions on the number 
of their cycles. 

DEFINITION 3.17 The number of n-permutations with k cycles is called the signless 
Stirling number of t!ufirst kind, and is denoted by c(n, k). 


Example 3.18 

For all positive integers ri> 1, we have c(n, n - 2) = 2 Q) + | (£) ( n p 2 )- 

n 

PROOF There are two ways an w-permutation can have n -2 cycles. It can 
have a 3-cycle and n -3 fixed points, or it can have two 2-cycles, and n -4 fixed 
points. | 

All we need to explain about the name “signless Stirling numbers” is the part 
“signless” and the part “Stirling numbers”. That is, what sign are we missing, 
and what is the connection between these numbers and the Stirling numbers of 
the second kind, defined in Chapter 1. We will answer these questions shortly. 
Let us start with the most basic property of the signless Stirling numbers of the 
first kind. 
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LEMMA 3.19 

Let us set c(n, 0)=0 if ri> 1, and c( 0, 0)=1. Then the numbers c(n, k) satisfy 
c(n, k)=c{n-1 , k-l)+(n-l)c(n-l , k). 

PROOF Take an w-permutation p with k cycles. Then the entry n of p either 
forms its on 1 -cycle, in which case we have dn-l, k-1) possibilities for the rest of 
the permutation, or is part of a larger cycle, in which case it can be mapped into 
one of n- 1 elements. In this case, we have c(n- 1, k) possibilities for the permutation 
that we obtain if we omit n from p, and the result follows. f 

The Stirling numbers c(n, 0), c(n, 1), •••, c(n, n) can be easily generated as the 
coefficients of a certain polynomial. This is the content of the next theorem. 

THEOREM 3.20 

For all positive integers n, we have 

n 

x(x + 1) ■ ■ fx + n - l) = Y c i n , k )x k ■ (3.3) 

k =o 

In other words, c(n, k) is the coefficient of yd in x(x+l)—(x+n-l). 

PROOF Let b(n, k) be the coefficient of xd in F n (x)=x{x+ l)--(x+ n- 1). It is then 
clear that b{n, 0)=0 for h> 0, and we set b(0, 0)=1. We claim that the numbers b(n, 
k) satisfy the same recurrence relation as the numbers c(n, k), that is, 

b(n, k)=b(n-l, AT) +(n-\) b(n-l, A). 

Indeed, F„(x)=(x+n-l)F„ 1 (x), so we have 

n n n— 1 

Y, b(n, k)x k = Yj b{n, k)x k+1 + (n — 1 ) ^ b(n — 1 , k)x k 
k=0 k=l k =0 

by definition. Taking the coefficient of xd on both sides of the last equation we get 
what was to be proved. | 

Note that Theorem 3.20 immediately implies the following property of the signless 
Stirling numbers of the first kind (a property which is often difficult to prove for 
other sequences). 
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COROLLARY 3.21 

For anyfixed positive integer n, t!w sequence { c(n, k) } 0 <i<o has real zeros only. In particular, this 
sequence is log-concave, and therefore, unimodal. 

We point out that a far-reaching generalization of this result has been proved by 
Francesco Brenti. See Problem Plus 10 for that result. 

By now, you probably know that our next question will be whether there 
is a combinatorial proof of the log-concavity of the sequence { c(n, k) } k . The 
answer is in the affirmative, as shown by the following construction due to 
B.Sagan [166]. In this proof, we modify our cycle notation a little bit by 
writing our cycles with their smallest entry first This will be called reverse cycle 
notation. 

In the construction of Sagan, and in solving some of the exercises of this 
chapter, the notion of gap positions turns out to be useful. A n-permutation 
written in cycle notation has n+ 1 gap positions, one after each entry p, (this gap 
position is in the same cycle as p ,), and one at the end of the permutation, in a 
separate cycle. For instance, if ^=(1) (23), then inserting 4 into the first, 
second, third and fourth gap position of p, we get the permutations (14) (23), 
(1) (243), (1) (234), and (1) (23) (4). Note that the reverse canonical form of p 
is preserved, no matter where we insert 4. This is why we chose this kind of 
cycle notation. 

Let P(n, k) denote the number set of w-permutations with k cycles. Define a 
map ®: Pin, k-\)xP(n, k+\)—>P(n, k)xP(n, k) as follows. 

Let p g Pin, m )• Let /;<,-> be the permutation obtained from p by removing all 
entries larger than i from the cycle decomposition of p. 

Example 3.22 

Let^=(124) ( 35 ),thenwehave^< 1 >=(l),^< 2 >=( 12 ),^< 3 > =( 12 ) (3), P< 4 >=(124) (3), 
and /><5>=(124) (35). Q 

Note that if p <,> has t cycles, then /)<,+,> has either t or t+ 1 cycles. Now let 
(p, q) e P(n,k - 1) x P(n,k + 1), and look at the sequences {/><,>} i<^„ and 
{<?<!>}i<&• Let j be the largest index for which p <£> has one cycle less than q<p,. Such 
an index must exist as the difference between the number of cycles of q<£> and 
p< 3 > changes by at most one at each step, starts at 0 (when f=l), and ends in 2 
(when i=n). 

Swap the elements of the pair (p<j>,q<fi), to get the pair (q<j>,p^>). Now insert j+ 1 
into the same gap position of q< t> as it would have to be inserted if we wanted to 
expand p^y into p<^i>. Similarly, insert j+ 1 into the same gap position of p<p, as it 
would have to be inserted if we wanted to expand <^> into yc /+ i>. Call the new pair 
of permutations obtained this way (5<j+i > : p'<j+ 1 >) ■ Then continue with 
analogous steps. That is, construct >, p' <j+a+l> ) from (q' <j+1> ,p' <j+1> ) 

by inserting 7 +a+l into the same gap position of p / <J+0> (resp. p / < j +a> ) that we 
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would use if we had to expand p <1+a> (resp. q< l+£ >) into p <J+a+ 1> (resp. q <t+a+1> ) . At 
the end of the procedure, we define $(p, q) = (q' <n> , p' <n> )- 

Example 3.23 

Let w=6 and kp=2, and let^>=(125463), and ^=(13) (24) (56). It is then clear that 
j= 4, and p<A>={ 1243) and ^ <4> =(13) (24). After swapping, we get the pair 

{q<s>t P<s> )=((13) (24),(1243)). 

To make further computations easier, we note that p <3> =( 12543) and <y <5> =( 13) 
(24) (5). 

The entry 5 would have to be inserted into the second gap position of/;< 4 > to 
get p K 5>, and into the fifth gap position of q <v> to get q K 5> . So we insert 5 into the 
second gap position of e/ <4> and into the fifth gap position of 4> . We obtain 

(9<5>>P<5>) = (( 135 )( 24 )>( 1243 )( 5 ))- 

Finally, the entry 6 would have to be inserted into the fourth gap position of/><-,> 
to get p<s>=p, an( J into the fifth gap position of q K 5 > to get q <e> =q. So we insert 6 
into the fourth gap position of f/ <5> and into the fifth gap position of p' <r)> . We 

end up with 

?) = (</< 8 >,P< 6 >) = ((135) (264), (1243) (56)). 


□ 

THEOREM 3.24 

The map O defined above is an injection that maps P(n, k-l)xP(n, k+1) into P(n, 
k)xP(n, k). 

PROOF First, O indeed maps into P(n, k)xP(n, k). This is because passing from 
p<c> to p <M> a new cycle is formed if and only if z+1 is inserted into the last gap 
position. By the definition of j, this happens one more time from to <y<„> than 
from ^ <;> to /;<„>. After swapping, this will precisely compensate for the extra 
cycle that we created when passing from q <v> to q^, in comparison to the segment 
from p <v> to p<p. 

To see that O is an injection, note that if (q A>, p ’< JL >) is in the image of ®, its 
only preimage is easy to reconstruct. Indeed, remove n from both permutations, 
then remove n- 1 from both permutations, and so on, and stop as soon as the first 
permutation has exactly one more cycles than the second. This provides the 
index j, and the preimage of (q’^fi ’<£>) is then obtained by reversing the algorithm 
used in constructing ®. 

See Figure 3.5 for the values of c(n, k) for 0<n<5. 
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n=0 i 

n=l 0 1 

n=2 Oil 

n=3 0 2 3 1 

n=4 0 6 11 6 1 

n=5 0 24 50 35 10 1 

FIGURE 3.5 

The values of c(n, k) for k<5. The NE-SW diagonals contain the values for fixed k. Row 
n starts with c(n, 0). 


As there seems to be no symmetry in any row of this diagram, it is natural to 
ask where the maximum of each row is. That is, for fixed n, what is the value (or 
values) of k for which c(n, k) is maximal. Recall that by Corollary 3.21 and 
Proposition 1.32, there are either one or two such values of k. (In the latter case, 
the two values of k must be consecutive.) We will call these values of k the peak of 
the sequence. 

Surprisingly, our main tool in answering this question comes from the real 
zeros property of the numbers c(n, k) as proved in Corollary 3.21. This enables 
us to use the following powerful, and not quite well-known, theorem of Darroch. 

THEOREM 3.25 

Let A(x) = o a kX k be a polynomial tluitlms real roots only that satisfies T(1)>0. Let 

mbe a modefor the sequence of the coefficients ofA(x). Let y=A’(l)/A(l). Then we have 


\y-m\ZA. 


See [66] for the proof of this spectacular result, as well as for further details. For 
certain values of n, an even more precise result is known. 

In other words, m differs from y by less than 1. So if we can compute y, we 
need to check at most two values of c(n, k) to find the peak of the sequence {c(n, 
k)}k. We say peak, not peaks, because Erdos [84] proved that this sequence has a 
unique peak if ri>3. 

Let A(x) = Ylk =o c ( n > k)x k - As A(l)=nl, all we need in order to apply 
Theorem 3.25, is to compute A\ 1). We get 

n 

A'(x) = & ' c ( n > k)x k ~ 1 , 

k=1 


A'(l) = k-c(n,k). 

k=1 


2004 by Chapman & Hall/CRC 


90 


Combinatorics ofPermutations 


In other words, A’(l) is nothing else but the total number of all cycles in all n- 
permutations. As A(l)=nl, we see that fc=A’(l)IA(l) is the average number of cycles of 
a randomly chosen n-permutation, foretelling the kind of objects we will study in 
Chapter 6. 


LEMMA 3.26 

The average number of cycles of a randomly chosen n-permutation is 


n 

H(n) = £ 7 - 


i= 1 


PROOF We prove the lemma by induction on n, the initial case of n= 1 being 
trivial. Assume the statement is true for n, and take a permutation^ on [n]. Now 
insert n+ 1 into any of the n+l gap positions of p. This will create a new cycle if 
and only if n+l was inserted into the last gap position, that is, in l/(w+l) of all 
cases. Thus the average number of cycles in a randomly selected (w+l)- 
permutation is 

.. , n+l . 

- - • H(n) + — -(H(n) + 1) = H(n) +-- = V - 

n + l n + l n + l C— J i 

t=l 


as claimed. 


■ 


Therefore, we have just shown that using Theorem 3.25, we can locate (up to 
distance 1) the mode of the sequence { c(n, k) \ for any fixed n. 

THEOREM 3.27 

Let n>3 be a fixed positive integer. Then the unique value ofk for which fin, k) is maximal is 
within distance 1 ofH(n). 

The time has come to explain the adjective “signless” in Definition 3.17. 

DEFINITION 3.28 The Stirling numbers of thefirst kind, are defined by s(n, k)=(-l) n ~ 
k c(n, k). 

The values of these numbers for n>5 are shown in Figure 3.6. 

COROLLARY 3.29 

For all positive integers n, we have 

n 

x(x — 1) ■ • * (* — n + 1) = ^ s(n, k)x k . 

k~ 0 
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n=0 




1 



n=l 



0 


1 


n=2 


0 


-1 


1 

n=3 

0 


2 


- 3 

1 

n=4 

o - 

6 


11 

- 

6 

n=5 

0 24 


-50 


35 

-10 


FIGURE 3.6 

The values of s(n, k) for ?z<5 The NE-SW diagonals contain the values for fixed k. Row n 
starts with s(n, 0). 


In other words, s(n, k) is the coefficient of x? in x(x-l)—(x-n+l)=(x)n. 

PROOF Substitute -x for x in formula (3.3), proved in Theorem 3.20. Then 
multiply both sides by (-1)”. | 

The motivation of assigning signs to Stirling numbers of the first kind is 
explained by the following theorem, which also shows the close connection 
between the numbers S(n, k) and s(n, k), justifying their similar names. 

THEOREM 3.30 

Let S be the infinite lower triangular matrix whose rows and columns are indexed by N, and 
whose entries are given by Sij=S(i,j). Let s be defined similarly, with Sij=s(i,j). Then we have 
Ss=sS=I. 


PROOF It is clear that A={ 1, x, a 2 , a 3 , •••} and B={ 1, x, (x ) 2 , (x) 3 , •••} are both 
bases of the vector space R [x] of all polynomials with real coefficients. Here 
(x) M =x(x-l)-(x-m+l). Corollary 3.29 shows that the numbers s(n, k) are the 
coordinates of the elements of B in basis A, while Exercise 21 shows that the 
numbers S(n, k) are the coordinates of the elements of A in basis B. In other 
words, s is the transition matrix from A to B, and S is the transition matrix from 
B to A Therefore, they must be inverses of each other. 


We mention that an explicit formula for the numbers s(n, k) has been known 
since 1852 [170]. It is significantly more complicated than the corresponding 
formula (1.16) for the Stirling numbers of the second kind. | 


s(n, k) 


£ (-i)‘ 

0 <h<n—k 


(n— 1 + h\ 
\n — k + h) 


f 2 n — k \ 
\n — k — hj 


S(n — k + h,h) 
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(h - j) n ~ k + h 


hi 


0 <i<h<n—k 


This formula is called Schlomilch’s formula, and we postpone its proof until 
the end of the Chapter. The reason for this is that in the proof of that formula, 
we will use the generating functions of the numbers s(n, k), which we are going to 
compute shordy. 

3.2.5 Generating functions for Stirling numbers 

Proving recursive formulae for Stirling numbers of the first kind combinatorially 
is not quite as easy as it is for Stirling numbers of the second kind. The additional 
degree of difficulty comes from the fact that the numbers s(n, k) are not always 
positive. Therefore, in many cases, an argument using generating functions turns 
out to be simpler. This is the subject of present section. 

Define the following double generating function for Stirling numbers of the 


first kind. 



PROPOSITION 3.31 

We have 


f(x, m)=(1+m)*. 


PROOF The coefficient of u n on the right-hand side is C) by the Binomial 
theorem. On the left-hand side, it is 



and the statement is proved. We have used Corollary 3.29 in the above equality. 


Sometimes an alternative form of f(x, u) is easier to use. 


PROPOSITION 3.32 

We have 



n>0 


PROOF Immediate from Corollary 3.29. | 
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The nice, compact form of the double generating function f(x, u) results in a 
plethora of recursive results on the numbers s(n, k). These recursive formulae 
are sometimes classified into three sets, triangular, vertical, and horizontal recurrences. 
These names are based on the arrays of the numbers s(n, k) involved. 

A triangular recurrence is readily obtained from Lemma 3.19. It is 

s(n, k)=s(n-\, £T)-(h-1) s(n -\, k), (3.4) 


where ii>\. 

Now we are going to use our generating functions to obtain a vertical 
recurrence. 


LEMMA 3.33 

For allfixed positive integers k and n, we have 

ks(n, k) — V (-I.)"-*"! . s (l,k- 1). 

l=k -1 

PROOF It is clear from Proposition 3.31 that 

d ^Q x U ^ = /(*,«) ln(l +*). 

The coefficient of P A u’in\ on the left-hand side is ks(n, k). The right-hand side 
equals 


/(*> u ) 5 Z (“ 1 ) I+1 j = S Y . 7f • H (- 1 )' +1 J- 


^- = EE*(mV? 

;>1 i >0 3=0 ' l> 1 

Therefore, the coefficient of P A u n !n\ on the right-hand side is 

n—1 




l=k-1 


proving our claim. 

Sometimes a univariate generating function is sufficient. The “horizontal” 
generating function of the numbers s(n, k) was given in Corollary 3.29. Their 
“vertical” generating function is computed below. 


LEMMA 3.34 

For anyfixed k, the exponential generatingfundion of the Stirling numbers f thefirst kind is 
given by 


ffiu) = 


n=k 


[ln(l + u)] k 
k\ 
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PROOF Changing the order of summation in f(x, u), we get 

n oo n n 

f(x,u ) = -^2fic( u ) xk - 

k =0 n=k k =0 

The proof now follows by equating the coefficients of x* in two forms of f(x, u), in 
the one above, and in 

f(x,u ) = (1 + u) x = exp [a: In (1 + u)] = ^ — x k . 

k =0 


From here, the analogous result for c(n, k) is a breeze. Note that in the next 
section we will see how to deduce this Corollary by a more direct and more 
combinatorial argument. 


COROLLARY 3.35 

For anyfixed k, the exponential generatingfunction of the signless Stirling numbers of thefirst 
kind is 


OO 

hk(u ) = ^2 c(n, k) 
n=k 


Til 


[— ln(l — u)]* - 
kl 


Finally, we are going to prove a horizontal recursion. Interestingly, the vertical 
generating functionyi(w) will be used in our proof. 


LEMMA 3.36 

For all positive integers k anel n we have 

s(n+l,k + l) = l) m_fc (7) s ( n,m ) - 


PROOF Differentiating our generating function /i/yj (after shifting indices by 
one), we get 


f; .(»+!,*+1)^ = 


n=k 


ki 


= exp(— ln(l + u )) 


[ln(l + u)] k 
kl 


£ 

m=k 


(—l) m [ln(l + u)\ m [ln(l + n)] k 


ml 


kl 
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= E 


(—l) m ~ fc [ln(l + u)] m 

(m — k)\kl 


Note that f h (u) itself is very similar to the last member of the last expression. This 
suggests that we further rearrange the last member as follows. 


J2 s (n + l,k+l)^ = k ['l j 


n=k 


m=k 

co 

= E 

n =k 


, U 

n! 


EM) 


m—k 


m 


s(n,m) 


n\ 


and the result follows by equating the coefficients of - 


3.2.6 An Application: Real Zeros and Probability 

There is a surprisingly strong connection between polynomials having real 
zeros, and random independent trials. It is described by the following theorem 
of P.Levy. 

THEOREM 3.37 

[Levy’s theorem] Let («o, «i, ■■■, a n ) be a sequence of non-negative real numbers, let 
A(z) = ff—Q CLk,z k and assume that A(1)>0. Then tlw following are equivalent. 

1. (i) The polynomial A (z) is either constant or has real zeros only; 

2. (ii) The sequence is the distribution ofthe number d n of successes in n independent 
trials with probability pi of success on tlw ith trial,for some sequence of probabilities 
Q=p=l. Tlw roots ofA(z) are given by -(l-pfpfor i withpfO. 

Note that the probability of success at trial i has to be independent of the result 
of all other trials, for all i. 

Example 3.38 

Let a, = ("). Then A(z)=( l+z)“, so A(z) indeed has real zeros only. For all i, let 
the ith trial be the flipping of a fair coin, and let heads be considered success. 
As the coin is fair, pr=H2 for all i. Then a,k/A( 1) = (”)/2” is indeed the 
probability of having exactly k successes. The roots of A(z) are all equal to -(1- 
pl)lp,=-l. □ 

This theorem shows that we can deepen our understanding of a given sequence 
having real zeros by finding its probabilistic interpretation. We would like to find 
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a probabilistic interpretation for the signless Stirling numbers of the first kind. 
That is, we want to find a sequence of n independent trials in which the probability 
of having k successes is c(n, k )/n\. 

Consider the following sequence of trials. We have one orange ball in a box. 
Let us pick a ball at random, (at the first trial, we have only one choice), then let 
us put this ball back to the box, along with a blue ball. Keep repeating this 
procedure n times. A success is when we draw an orange ball. Let p t be the 
probability of k successes in our n trials. 

Let Ajbe the event that we choose an orange ball at trial j. Then I\Af = 1, and 
we are going to compute F\A~j[. At the beginning of trial j, there is one orange ball 
and (/T) blue balls in the box, showing that 


1 + (j - 1) j ' 

Consequently, the probability of failure at trial j is P[Aj\ = ■ L f-. As our n trials 
are independent, the probability of k successes in n trials is computed as 

* = £ P\A h ]P[A j2 \ ■ • ■ P[A jh ]P[A jk+1 ] ■ ■ ■ P[A jn ], 

where the sum is taken over the family A of all possible (w-A)-element subsets 
(/i+i, j k+ 2, ,j n } of the set (2, 3, ■■■, n }. (There is no chance of failure on the first 
trial.) Using the probabilities computed above, this yields 

dk = \ % y ' K jk+1 ~ l)(jk+2 — 1 ) • • ‘ (jn ~ 1)- 

n! ^ 



Exercise 35 shows that the value of the sum above is c(n, k), implying that 


Pk 


c(n, k) 
n\ 


Note that while Levy’s theorem is often a useful tool to prove the real zeros 
property, it is relatively easy to make a mistake while doing so. See Exercise 27 
for a correct application of this method, and see Exercise 61 for a caveat. 


3.3 Cycle Decomposition versus Linear Order 
3.3.1 The Transition Lemma 

We have seen two different ways of looking at permutations. One considered 
permutations as linear orderings of [n] and denoted them by specifying the order 
p\pi" 'pn of the n elements. The other considered permutations as elements of the 
symmetric group S n and denoted them by parenthesized words that described 
the cycles of the permutations. We have not seen, however, too many 
connections between the two different lines of thinking. For instance, we have not 
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analyzed the connection between the “visually” similar permutations 2417635 
and (2) (41) (7635). Fortunately, such a connection exists, and it is a powerful tool 
in several enumeration problems. This is the content of the following well-known 
lemma. 

LEMMA 3.39 

[Transition Lemma] Letp be an n-permutation written in canonical cycle notation, and letf(p) 
be the n-permutation written in the one-line notation that is obtainedfrom p by omitting all 
parentheses. Then tlw map f: S n —>S n is a bijectixm. 

Applying this bijection to the example of the paragraph preceding the lemma, we 
have/((2) (41) (7635))=2417635. 

PROOF It is clear that / indeed maps into S„. What we have to show is that it has 
an inverse, that is, for each w-permutation q, there exists exactly one p so that 

f(p)=q- 

Let q=qi qr"T, be any w-permutation, and let us look for its possible 
preimages under f. The first cycle of any such permutation p must obviously 
start with q x . Where could this cycle end? As p is in canonical notation, each 
cycle starts with its largest element. Therefore, if qi>q x , then ^ cannot be part of 
the first cycle of p. So if j is the smallest index so that qj>qu then the first cycle 
of p must end in q jA or earlier. On the other hand, we claim that this cycle 
cannot end earlier than in q ]A . Indeed, by the definition of j, we have qT/y if 
1 </£<;, so if the second cycle started somewhere between q x and q p it would 
start with an element less than q x . That would contradict the canonical property 
of q. 

Thus the second cycle of p must start with the leftmost entry cy of q that is 
larger than q x . By identical argument, the third cycle of q must start with the 
leftmost entry of q larger than q p and so on. The procedure will stop with the 
cycle that starts with the entry n. This yields a deterministic, and always 
executable, algorithm to find the unique preimage of q under p. 

In our running example, we have ^=2417635. This yields <y—l. The smallest/so 
that qf>q\ is j= 2, so the second cycle will start in the second position. Then, the 
leftmost entry larger than 4 is 7, so the third cycle starts with 7, and we got the 
permutation (2) (41) (7635). 

Taking a second look at our proof, we note that we can explicidy describe the 
entries of q at which a new cycle is started. These are precisely the elements that 
are larger tlmn everything on tlwir left. This is an important notion. 
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DEFINITION3.40 Let q=q]_qr--q„ be a permutation. We say that q { is a left-to-right 
maximum if, for all k<3, we have qftq,. 

Example 3.41 

The permutation 21546837 has four left-to-right maxima. These are the entries 
2, 5, 6, and 8. [] 

Left-to-right minima, right-to-left maxima, and right-to-left minima are defined 
accordingly. 

COROT.T. ART 3.42 

The number of n-permutations with exactly k left-to-right maxima is c(n, k). 

PROOF Bijcction/of Lemma 3.39 maps the set of w-permutations having exacdy 
k cycles onto this set of permutations. 


3.3.2 Applications of the Transition Lemma 

Lemma 3.39 has several interesting applications to random permutations. We 
will discuss these in Chapter 4. For now, we fulfill an old promise by proving 
Theorem 1.35. For easy reference, that theorem stated that the number of n- 
permutations having exactly k-\ excedances is A(n, k). Also recall that i is an 
excedance of the permutation p=p\pr--p n if pft. Similarly, z is a weak excedance 
of p if p(i)>i. 

PROOF (Of Theorem 1.35). We show that the Injection/of Lemma 3.39 maps 
the set of w-permutations with k weak excedances onto the set of n-permutations 
with k-l ascents. 

The proof will be somewhat surprising. The definition of excedances is given 
in terms of permutations as linear orders, so one would not expect to count 
excedances on permutations that are written in cycle notation. However, this is 
precisely what we will do. 

Let 7r = (pi • • 1 • • -Pif) ■ ■ ■ (pi 3 _ 1+ 1 • • -p-ift be written in canonical 

cycle notation. 

We now apply/to n, and count the ascents. We claim that z’ is an ascent off ft), 
that is, pftpi .fi if and only if ifti and p<7l(p). 

To see the “only if” part, note the following. Unless i is a position at the end 
of a cycle, we have n (p)=p i+1 as each element of a cycle is mapped into the one 
immediately on its right, except for the last one. Therefore pftpM. implies p<dt(p). 
When z’ is a position at the end of a cycle, then /</',+1 always holds because of the 
canonical property, and so does p<,K (p). Indeed, p, is mapped to the first element 
of its cycle, and that is always larger than/ (or equal to p„ if the cycle containing 
pi is a 1-cycle). 
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The proof of the “if” part is similar. If we have p<n(p), then either i is a 
position not at the end of a cycle, or it is at the end of a cycle. In the former case, 
n(p)=p l+u and therefore p;<p I+1 . In the latter case, p(p)=pf\i the cycle containing 
p, is a 1-cycle, wa.Ap,<p l+1 otherwise as p<K(p)<p l+ i. Indeed, p ;is less than the first 
entry of its cycle, and that is in turn less than the first element of the next cycle 
by the canonical property. 

We have shown that the bijectiony' of Lemma 3.39 turns the weak 
excedances of 7T other than winto ascents of f(n). Finally, Exercise 13 shows 
that the number of w-permutations with k weak excedances is the same as the 
number of n-permutations with k -1 excedances, completing the proof of our 
theorem. 

Example 3.43 

Let 7T=(32) (514) (76). Then in the one-line notation, we have 7£=4325176. This 
permutation has four weak excedances, namely at 1, 2, 4, and 6. On the other 
hand, f(n)—i 251476 has indeed three ascents, at 2, 4, and 5. [] 

Note \haif(n) had its ascents at 2,4, and 5. This is where7T has its weak excedances 
(besides n), if we use the notations of the proof, that is, ^(32) (514) 
(76 )=(pip 2 ) (pspsps) (pcpi) ■ In deed, entries 2,1 and 4, that is, the second, fourth and 
fifth entries are mapped into a larger entry. 

Lemma 3.39 can also be used to prove the following surprising fact. We have 
computed in Lemma 3.26 that the total number of cycles in all w! permutations of 
length n, is n\H (n ) = w!(l + | + • ■ ■ + ~). On the other hand, it is not hard to 
prove, either as a special case of Exercise 18 or directly, that similarly, c(n+ 1, 
2 )=n\H(n). In other words, there are as many (ycles in all n-permutations as there are 
(n+ 1) -permutations with exactly two (ycles, if ri> 1. The fact that these two sets are 
equinumerous certainly asks for a bijective proof, and we are going to provide 
one. The crucial idea will be that omitting n+ 1 from its cycle can split that cycle 
up into many smaller cycles. 

Recall the map f: .S’„—>.S’„ of the Transition Lemma. This map was defined on 
permutations of length n. With a slight abuse of notation, we will also use the 
notation / for the analogous map defined on any S m . This is because we will use 
f on a permutation, and then on some of its parts. 

Let 

A = {p € S n+ i\p has two cycles} 

and let 

B = {(q, C)\q € S n and C is a cycle of q.} 

Define g: A—>B as follows. Set E to be the cycle of p that does not contain the 
entry n+1. Let D to be the other cycle of p, the one that contains n+ 1. Then we 
can consider das a permutation, just it is not a permutation of [n], but of some 
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other set S. Apply /'to D (that is, omit the parentheses), then omit n +1 from the 
resulting permutation of S, to get the new permutation D ’. Now apply/ back to 
D’t o get a permutation / of 5-{w+l} that is written in canonical cycle notation. 
Then we define q=Eq’, that is, q is the product of the disjoint cycles E and q’. 
Finally, we set g(p)=(q, E). 

Example 3.44 

Let «= 8 , and let/=(4231)(97586). Then _E=(4231), and D=(97586). Therefore, 
D— 7586, so q— (75)(86). This yields ^=(4231)(75)(86), so we have g(p)=(q, 
E)=(( 4231)(75)(86), (4231)). Q 

THEOREM 3.45 

The map g: A—>B is a bisection. 

PROOF We prove that ghas an inverse. That is, we show that for any (q,C) G B, 
there is exacdy one p £ A so that g(p)=(q, C) holds. 

The crucial idea is that we can use n +1 to “glue” together all cycles except for 
Cinto one cycle. Let Q, C 2 , C t be the cycles of q in canonical order, with C=C. 
To get the preimage of the pair (q, C) under g, omit all parentheses from the 
string QQ-Qi C I+ , C h , then prepend the obtained permutation with the entry 
n+ 1, to get the single cycle H. Then define p to be product of the two disjoint 
cycles C and H. This shows that (q, C) has a preimage. (Note that if q had only 
one cycle, then the list C 1 Q---Q. 1 Q 4 . 1 -CA is empty, and we just create a 1 -cycle 
consisting of the entry n +1 only.) 

To see that (q, C) cannot have more than one preimage is straightforward. 
Indeed, a change in either cycle of / clearly results in a change of /// | 

Example 3.46 

To find the preimage of the pair (q, Q=((4231)(75)(86), (4231)) under g we 
prepend the permutation 7586 by the maximum element 9. This leads to the 
cycle (97586), yielding g 1 (q, Q=(4231) (97586) as we expected because of Example 
3.44. Q 


3.4 Permutations with restricted cycle structure 
3.4.1 The exponential formula 

We start with a very brief introduction to the main enumerative tool of this 
section. Many of the readers may have seen this material in a general combi¬ 
natorics textbook. Other readers may consult Chapter 8 of [27] for a longer 
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treatment on an introductory level, or Chapter 5 of [180] for a discussion at an 
advanced level. 

Recall that the exponential generating function of the sequence Oo, «i, a ••• of 
real numbers is the formal power series 

A{x) = Y j a n —. 

n> 0 

It is not always necessary to require that the a, be real numbers. In general, it 
suffices to assume that they are elements of a field of characteristic zero. 

LEMMA 3.47 Productformula 

Letf(n) be the number of ways to build a certain structure on an n-element set, and let g(n) 
be the number of ways to build another kind of structure on an n-element set. LetF(x) and G(x) 
be the exponential generatingfunctions of the sequencesf(n) andg(n),for n=0 ,1, 2, Finally, 
let h(n) be the number ofways to split an n-element set into two subsets, then to put a structure 
ofthefirst kind on thefirst subset, and to put a structure ofthe second kind on the second subset. 
Let H(x) be the exponential generatingfunction of the sequence h(n),forn=Q, 1, 2, ■■■. Then 
we have 


F(x)G(x)=H(x). (3.5) 

PROOF The number of ways to split an w-element set into a ^-element set and 
its complement, then build the above structures on these sets is obviously 
( 2)f(k)g(n — k). Summing over all k, we get that 


K n ) = “ k )■ 

k=0 ' ' 


(3.6) 


So our lemma will be proved if we can show that the expression on the righthand 
side of (3.6) is in fact the coefficient of x"/n\ in G(x)F(x). This is simply a question 
of computation as 


F{x)G(x) 


_ rpLl _ rpH 

n> 0 n>0 

Z Z whk)< !{k)ain ~ k) 

n>0 k =0 v ' 

zSz(i)/«»(-^ 

n >0 k =0 v ' 


I 
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We have two remarks about the two subsets into which we have partitioned 
our w-element set. First, the partition is ordered, that is {1, 3}, {2, 4, 5} is not 
the same partition as {2, 4, 5}, {1, 3}. Indeed, in the first case, we build a 
structure of the first kind on the block {1, 3}, and in the second case we build 
a structure of the second kind on that block. Second, empty blocks arc permitted. 
This is why the generating functions F(x) and G(x) start with a constant term. 
In other words, our two subsets form a weak ordered partition of our w-element 
set. 

Iterated applications of Lemma 3.47 lead to the following. 

COROLLARY 3.48 

Let k<n be positive integers. For i g [jfc], letf(n) be the number ofways to build a structure of 
a certain kind i on an n-element set, and let Ffx) be the exponential generatingfundion of the 
sequence f(n). Finally, let h(n) be the number of ways to split an n-element set into an ordered 
list of blocks {B u B 2 , —, Bfi then build a structure of kind i on B^for each i £ [M IfH(x) is 
the exponential generatingfundion of the sequence h(n), then we have 

H(x)=Ffx)F 2 (x)-F k (x). 

Finally, note that it was not particularly important in our proof that the numbers 
f(n) were integers, or that they counted the number of certain structures. What 
was important was the relation betweenyjw,) and h(n). This leads to the following 
generalization. 

THEOREM 3.49 

Let K be a field of characteristic zero, and letfi: N—>K be functions, 1 <i<k. Define h: 
N—>Kby 


h(n) = '£ / ff\A 1 \)f 2 (\A 2 \)...f k (\A k \), 

where the sum ranges over all weak ordered partitions (A h d-2, ■, A k ) of [n] into k parts. Let 
Ffx) andH(x) be the exponentialgeneratingfunctions of the sequences f(n) and fin). Then we 
have 


H(x)=F 1 (x)F 2 (x) ■ ■ -Fn (x). 

You could ask what is the advantage of letting / and h map into any field of 
characteristic zero, as opposed to just the field of real or complex numbers? The 
answer will become obvious when we discuss cycle indices later in this section. 
For the time being, we just mention that K will often be chosen to be a ring of 
polynomials. 

Theorem 3.49 yields an immediate second proof of Corollary 3.35. For easy 
reference, that corollary claimed that the exponential generating function of the 
signless Stirling numbers of the first kind is 
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OO 

hk{u) = ^2 c ( n >k) 

n=k 


ni 


[- ln(l — u)] k 

k\ 


Indeed, h k (u) is just the exponential generating function of the numbers c(n, k). 
Note that k\c(n, k) is the number of ways to choose an w-permutation with k- 
cycles, then order its set of cycles. However, this is the same as partitioning [n] 
into k nonempty subsets (A h A 2 , —, Af then taking a 1-cycle on each of the A;. 
Therefore, with the notations of Theorem 3.49, we ha.we.f(k)=(k-\) \ for each i, 
with/(0)=0. This yields A; (u) = E* i T = Tn(l-u) for each i, and our claim is 
proved. 

Our main tool in this section, the Exponential formula, will be stated and 
proved shortly, using Theorem 3.49. First, however, we have to take some 
precautions. We want to work with power series of the type exp (F(x)), where F(x) 
is some formal power series. 

A little thought shows that exp (F(x))=l+F(x)+F(x) 2 l2+--- is defined if and only 
if A(0)=0, or in other words, when Alias no constant term. Indeed, it is precisely 
in this case that the coefficient of any x m in 1 + F(x) + 
F(x) 2 /2 + ■■■ = YYY-o w iH be a finite sum. Therefore, we to make sure 
we only talk about exp (F(x)) when we know that Ahas no constant term. 

Having settled that, we are ready for the exponential formula. Let P denote 
the set of all positive integers. 

THEOREM 3.50 

[The Exponential Formula] Let K be a field of characteristic zero, and let f: P —>K be a 
function. Define h: N—>iv by A(0)=1, and by 

h(n) = £/(|A 1 |)-/(|A 2 |) ./(|A m |), (3.7) 

for n> 0, where the sum ranges over all partitions (A h A 2 ,-,A n ) of [n] into any number of 
parts. LetF(x) andH(x) be the exponential generatingfunctions of the sequencesf(n) andh(n). 
Then we have 


H(x)=ex p (F(x)). 

In particular, if f(n) is the number of ways one can build a structure of a certain 
kind on an n-element set, then h(n) is the number of ways to take a set partition 
of an w-element set, and then to build a structure of that same kind on each of the 
blocks. 

Note that in contrast with the product formula, here we take a partition of our 
n-element set. This means that the set of blocks is not ordered, that is, {1, 3}, {2, 4, 
5}, and {2,4,5}, {1,3} are considered identical, and empty blocks are woipermitted. 
There is no restriction on the number of the blocks, unlike in Lemma 3.47 and in 
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Corollary 3.48. This is why we have to exclude empty blocks; otherwise the 
number of our partitions would be infinite. 


PROOF (of Theorem 3.50) Let 


h k {n) = Y J f(\Ai\)f{\A 2 \)---f(\A k \) 


where the sum ranges over all partitions of [n] into k parts, for a fixed k. 

Now that the number of blocks is fixed, we can use Corollary 3.48, (with 
f[ 0)=0), keeping in mind that in that corollary, the set of blocks is ordered. We 
get 


hk(n) = ■'■ffi.rf. 


Summing over all k, we get what was to be proved. | 

We point out that there is a stronger version of this theorem, the Compositional 
formula, but we will not need that here. Interested readers should check Exercise 
48. 

Example 3.51 

Let A(0)=1, and let h(n) be the number of ways to take a set partition of [n] and 
then take a subset of each block. Then we have 



n>0 


□ 


PROOF Let f(n) be the number of ways to choose a subset of a nonempty set. 
Then/(n)=2” for wX), andy(0)=0. Therefore, the exponential generating function 
of the sequenceyiw) is F(x)=ex. p(2x)-l, and the proof follows from the exponential 
formula. 

COROLLARY3.52 

[Exponentialformula, permutation version] LetK be afield of characteristic 0, and letf: P—>iv 
be any function. Define a new function h by 


Hn)= £ f(\C 1 \)f(\C 2 \)---f(\C k \), 


(3.8) 


pES, 
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where the C, are the cycles of p, anel |C, | denotes the length of Q. Let F(x) and H(x) be the 
exponential generatingfundions off and h. Then we have 


H ( x ) = exp 



PROOF Note the subtle difference between the summation on the righthand 
side of (3.8) and the summation in (3.7). The former is a summation over all n\ 
elements of S„, the latter is a summation over all set partitions of [n], As a k- 
element block of a partition can form (A-1)! different Ccycles, a partition with 
blocks B u B 2 , —, B h gives rise to (|2Jj| — 1)! permutations whose cycles are 
the blocks B r Therefore, (3.8) can i>e rearranged as 

h(n)= Y, /(I-^iDG-®! I — l)!/(l-® 2 |)(j-B 2 | - 1)! • • ■ f(\Bk\){\Bk\ - 1)1 
Be n„ 


Then the exponential formula (applied to (n-l)f(n) instead of f(n)) implies 


H(x) = exp((n - 1)! • F(x)) = exp 



which was to be proved. | 

The following application of Corollary 3.52 enables us to count permutations 
that consist of cycles of certain lengths only. 

THEOREM 3.53 

Let C be any set of positive integers, and let g c (n) be the number of permutations of length n 
whose <ycle lengths are all elements of C. Then we have 

Gc(f = Y 9 c ^f\ = exp ( H ff 

n> 0 * \n(zC 



PROOF Let f(n)={n-V)\ if n G C an< i let f(n)={) otherwise. Theny gives the 
number of ways we can cover a set of n elements by a cycle of an acceptable 
length. Then F{x) = f( n )lP = TT' ® U1 c ^ a i m the 11 follows from 

the Exponential formula. | 

An involution is a permutation^ so that p- l =p, in other words, jf=\. It is easy to 
see that this happens if and only if all cycles of p have length 1 or 2. Note that 1- 
cycles are often called fixed points. 
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Example 3.54 

Let G 2 (x) be the exponential generating function for fixed point-free involutions. 
Then we have G 2 (x)=exp [PIT], [] 


COROLLARY 3.55 

The number offixedpoint-free involutions of length 2n is h(n) = 2 ^nT = : l‘3---(2re-l). 


PROOF By Example 3 .54, we only have to compute the coefficient gi (n) of 
in G 2 (xj=exp(x 2 /2). We have 


exp(a; 2 /2) 


E 

n>0 


X 2 " 

2 n -nV 


so 


9z(n) 


(2n)l 

2 n • n! 


1 ■ 3 ■ ■ ■ (2n — 1) = (2n — 1)!!. 


■ 


The symbol (2w-l)!! reads “(2n-l) semifactorial”, referring to the fact that we 
take the product of every other integer from 1 to 2n-l. 

Note that G 2 (x) does not contain terms with odd exponents. This makes perfect 
sense as a fixed point-free involution cannot be of odd length. 

In what follows, we are going to look at some interesting enumerative results 
for some particular choices of the set C. Many of these results could be obtained 
by some clever combinatorial arguments and without the use of generating 
functions. We, however, prefer to present the general technique of generating 
functions, and leave the bijective proofs for the exercises. 

Recall that integrating the identity 1/(1 — x) = y/ n>0 x n we get the identity 


— ln(l — x) 


(In {l-x)T l = Y, 

n> o 


x n+l 

n + 1 


E 


n 


(3.9) 


The following example introduces a widely researched class of permutations. 

Example 3.56 

Let D(n) be the number of fixed-point free w-permutations (or derangements). Then 
we have D(x) = J2 n > 0 D ( n )~r = □ 

PROOF In this case, we choose C to be the set of all positive integers 
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D{x) 


G c {x) = exp 



= exp((ln(l — x)) 1 — x) = 


exp(— x) 
1 - x 


I 


Routine expansion of the last term yields the following formula for the number 
of derangements of [n], 

COROLLARY 3.57 

LetD(n) be the number ofderangements of length n. Then we have 

n 

D(n)=n!£(-ir-. 

i=0 

Note that by this formula, we have (D (n)/n!) —> i, so more than one third of all 
permutations will be derangements. 

One may be interested in finding the number of all elements of the symmetric 
group S„ whose order is odd. These are the elements with odd cycles only. This 
is the motivation of the following example. 


Example 3.58 

Let Cbe the set of all odd positive integers. Then we have G(;(x) 

□ 



PROOF By Theorem 3.53, we need to compute 


G c (x) = exp ( x + ~ H-) = exp [ y 


r 2n+l 


t n>0 


(2n + 1)! / ‘ 


Now note that taking derivatives, we have 


„2n+l 


y ^— =y 

^ 2n + 1 Z -' 


1 


1 1 
+ 


k n>0 


n>0 


1 — x 2 2(1 — x) 2(1 + x) 


Therefore, 


._, r 2n+L 1 

^2^T = 2 (ln(1 “ a:r3+ln(1+S)) ’ 

n>0 
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and so 


G c {x) = exp ( -(h^i 


•*))' 


_1 +ln(l + x) X ) = 


1 + x 


which was to be proved. | 

COROLLARY3.59 

For all positive integers n, the number of permutations of length 2 n that have odd cycles only is 
ODD (2w)=(l-3-5-"(2w-l)) 2 =(2w-l) !! 2 . Similarly, the number of permutations of length 2w+l 
that have odd cycles only is ODD (2w+l)=(l-3-5-"(2w-l)) 2 (2w+l)=(2n-l)!! 2 (2w+l). 

This remarkable result has several proofs. A combinatorial one for a more general 
version is given in Problem Plus 55. 


PROOF Using the result of Example 3.58, all we have to do is find the coefficients 
of x m /m\ in Gc{x) = Multiplying both the numerator and the 

denominator by -|- x, we get 

G{x) = Trv?- (310) 


By the binomial theorem, we have 


(l-x 2 )- 1 ' 2 



(3.11) 


= ( i) m tlB. • (~ 3 / 2 ) ■ - ■ (- ( ' 2m - 


m>0 


= £ 


ml 


(2m - l)!!_^ 2m 


m> 0 


ml ■ 2 71 


(3.12) 

(3.13) 


Note that (l-x 2 )' 1 ® has no terms of odd degree. 

This shows that the coefficient g c (2m) of x 2m /(2m)\ in our generating function 
G C {X) = (l + x)Zm>0 

(2m)! ■ = (2m - l)!! 2 , 

v ’ ml • 2™ v J 

while the coefficient g c (2m+l) of x 2 ” +1 /(2m+l)f is 


(2m + 1)! (2m — 1)!! 


m\ 


2™ 


= (2m - l)!! 2 (2m + 1), 
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As a natural counterpart of permutations with odd cycle lengths only, let us now 
look at permutations of length n that have cycles of even lengths only. 

Example 3.60 

Let Cbe the set of all even positive integers. Then we have Gc(x) = \J •• 

□ 

PROOF Theorem 3.53 implies 
G c {x) 


„2 n 




i n> 1 


(3.14) 


Note that the argument of the exponential function on the right hand side is very 
similar to (3.9), with x 2 playing the role of x. Therefore, (3.14) implies 

G c {x) = exp i i(ln(l - x 2 ))" 1 ^ = 


as claimed. 

COROLLARY3.61 

For all positive integers m, the number of ( 2ni)-permutations tluit luive even cycles only is EV 
EN\2m)={2m-l)U 2 . 

PROOF Using the r esult o f Example 3.60, all we have to do is find the coefficients 
of x 2m /(2m)\ in y —-x 1 • We have computed this power series in (3.11), and 
saw that the coefficient of x lm l(2m)\ was indeed (2m-l)!! 2 . 

This leaves us with the intriguing observation that ODD(2m)=EVEN(2m) for 
all positive integers m. This fact asks for a bijective proof. One is given in 
Exercise 53. 

In the above examples, we have had conditions on the cycle lengths based on 
their parity. One could just as well compute the generating function for the 
number of w-permutations with no cycle lengths divisible by k, or all cycle lengths 
divisible by k. See Exercises 5 0 and 5 5 for generating function proofs, and Problem 
Plus 4 for combinatorial proofs. 
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3.4.2 The cycle index and its applications 

We would like to refine our permutation counting techniques. Theorem 3.53 
allows us to count permutations with a given set of cycle lengths. However, if we 
want to count permutations that have, say, an even number of fixed points, or an 
odd number of 2-cycles, then we cannot apply Theorem 3.53 directly. 

In order to be able to handle complex situations like these, we introduce extra 
variables. Let p be an w-permutation that has a, cycles of length i. We then associate 
the monomial II" =1 jf“‘ to p. For example, if/=(312) (54) (6), then we associate 
the monomial 444 to p, whereas if/=(21) (3) (54), then we associate the monomial 
htl to p. 

Summing all these monomials for all w-permutations, we get the augmented cycle 
index Z(S n ) of the symmetric group S„. 

That is 

Z(S n ) = Z(S n )(t 1 ,t2,---,tn)= t? 1 

p6S n 

where a, is the number of z-cycles of p. We note that the cycle index Z(Sj of S„ is 
defined by Z(S n ) = jy_Z{S n ). 

Example 3.62 

For n= 3, we have 

Z(Sn) = fi+ 3M 2 + 3%. 


□ 


The reason the augmented cycle index is a useful tool for the enumeration of 
permutations is the following observation. 


PROPOSITIONS. 63 

Let Z(S n ) be defined as above. Then we have 


E i)^T = exp 

' n! 

n>0 



PROOF Recall that a rationalfimctixm over a field is just the ratio of two polynomials 
over that field. Let iv be the field of all rational functions in variables t h 4, ■■■, 4 
over Q. Define/: VsKhy f(n)=l lr Then 

Z(S«)= E f(\Ci\)f(\C 2 \)---f(\C k \), 

pes n 

where the C t are the cycles of p. Our claim is then immediate from Corollary 
3.52. | 
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You may wonder how introducing n new variables and obtaining one new 
equation will help us. As you will see from the upcoming examples, the strength 
of Proposition 3.63 is its generality; we can choose any values for the 4 and the 
equation will still hold. 

Example 3.64 

Let g(n) be the number of n-permutations that have an even number of fixed 
points, and let G(x) = o 9 (n)x n /nl.. Then we have 



□ 


Before we start our proof, we remind the reader to the identity cosh(x) = 
(exp a; + exp(-a;))/2 = En>o (W In other words, cosh x is obtained by 
omitting the odd terms from exp x. 

PROOF The crucial observation is that 


Z{S n )(l, 1, - • •, 1) + Z(S n )(—l, 1, • • •, 1) = g{n) 


Applying Proposition 3.63 we get 



x n exp(ai) + exp(— x) 


,, x exp(-;r) 

exp = coshla;) ■ —- 

' 1 — x 



In the last step, we used the result of Example 3.56. 


Example 3.65 

Let h(n) be the number of n-pcrmulations that have an odd number of 2-cycles 
and no fixed points. Then 



□ 


PROOF Note that 


Z(S ra )(0,1,1, • • •, 1) - Z(S n )( 0, -1,1, - • •, 1) = h{n) 


2 
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The claim then follows just as in the previous example. | 

Let p be an w-permutation. We say that p has a square root if there exists another 
n-pcrmutation q for which f=p. Note that p can have many square roots. For 
instance, all involutions are square roots of the identity permutation. Whether p 
has a square root or not is easy to tell from its cycle decomposition. 

LEMMA 3.66 

The permutation p has a square root if and only if its unique decomposition into the product of 
distint cycles contains an even number of cycles of each even length. 

Example 3.67 

The permutation (21) (64) (753) has a square root as it has two cycles of length 
two, and zero cycles of any other even length. The permutation (4231) does not 
have a square root as it has add odd number (one) of cycles of length four. 

□ 

PROOF (of Lemma 3.66) Assume p=P. When we take the square of r, the odd 
cycles of r remain odd cycles. The even cycles of r will split into two cycles of the 
same length. Therefore, the even cycles of r 2 come in pairs, proving the “only if” 
part of our assertion. 

To see the “if part”, assume that p has even cycles ( ay-ap and (by-bp. These 
even cycles can be obtained by taking the square of the (2A)-cycle (ap x af) 1 an l —a l })p. 
Odd cycles oi p, such as (did 3 d 5 ---d J fd i ---dj. 1 ) can be obtained as the square of 
(dif-'-d). So we can find roots for all cycles of p this way, then we can take the 
product of the obtained cycles to be the permutation r. | 

As we characterized permutations having square roots by their cycle lengths, 
we can use Theorem 3.53 to find the exponential generating function for their 
numbers. 

THEOREM 3.68 

Let SQfn) be the number of n-permutations that have a square root, and let 
SQ(x) = SQ n\^ xU - ^ len we '' mve 

i> 1 

PROOF Recall that our permutations must have an even number of 
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cycles of each even length. Therefore, repeated application of the method seen in 
the proof of Example 3.64 yields 

SQs&Mexp x)-(cosh(*?/2))-(exp x 3 / 3 )-(cosh(x 4 /4))—. (3.15) 

Our claim is then proved recalling that we have already computed the power 
series exp in Example 3.58. | 


COROLLARY 3.69 


For all positive integer n, we have SQf2n)-(2n+l)=SQ(2n+l). 


This is an interesting result. It means that when we pass from 2 n to 2w+l, the 
number of permutations with square roots grows just as fast as the number of all 
permutations. 


PROOF It suffices to show that the coefficients of x 2 ” and x 2 " +1 in SQfx) are 
identical. As //)> t cosh^'/Qi)) does not contain terms with odd exponents, it suffices 


to show this for 



. Recall again that we have seen in Example 3.58 that 



Y.ODD(nff 


n> 1 


Corollary 3.59 then shows that ODD(2n+l)=(2n+l) 0DD(2n), which is just what 
we needed. | 

In other words, the probability that a randomly chosen 2w-permutation has a 
square root is equal to the probability that a randomly chosen (2 n+ l)-permutation 
has a square root. This nice identity certainly asks for a combinatorial proof. 
One is given in Exercise 59. Not surprisingly, it builds on the combinatorial proof 
of the equality 0DD(2n+l)=(2n+l) 0DD(2n). 


3.4.2.1 Proving the formula of Schlomilch 

We return to the task of proving the formula of Schlomilch. This is a somewhat 
complicated computational argument. In order to prevent our proof from becoming 
tedious, we have left some technical steps as exercises. The reader may try to fill 
in these gaps, or may check them in the Solutions to Exercises section. 

First, and this is interesting, we will need the Lagrange Inversion Formula. It is 
remarkable that a simple problem as the one at hand calls for this advanced 
technique. 

Let f and g be two formal power series in YL[[x]]. We say that g is the 
(compositional) inverse of fiif(g(x))=x holds. One can prove that in that case, 
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g(f( x ))=x also holds. It is also straightforward to check that if/has an inverse, then 
this inverse is unique. We will denote the inverse of /'by/'T Finally, we point 
out that/"has an inverse if and only if/0)=0, that is, when/has no constant 
term. 

If / G R[[ar]] is a formal power series, then let [d]f denote the coefficient of xf 
in/ 


LEMMA 3.70 

[Lagrange Inversion Formula] Let f £ R[[x]], and let k and n be positive integers satisfying 
\<k< n. Then we have 


[x a ]{ fc- 1 ^ = 




(3.16) 


The Lagrange Inversion Formula has several interesting proofs, among which we 
can find combinatorial, algebraic, and analytic arguments. See [180] for one proof 
of each kind. The original argument of Lagrange actually proves a stronger 
statement in that k and n are only required to be integers. Gilbert Labelle el al. 
found combinatorial proofs for various versions of the Lagrange Inversion formula, 
including multivariate generalizations. See [43], [106] and [142] for these results. 

Now we are in a position to prove the formula of Schlomilch. 


THEOREM 3.71 

For all positive integers k and n satisfying k<n, we have 

s(n, k) — £ (-1 ) h ( n ~l +h i)( 2n l\ 

d—* \n — k + hi \n — k — h 


0<h<n—k 


S(n — k + h,h) 


E 


0 <i<h<n—k 


h\ (n — 1 + h 


2 n — k 


n — k + hj \n — k — h J 




n~k-\-h 


hi 


PROOF First note that the result of Lemma 3.34 provides a formal power series 
whose coefficients are very close to the numbers s(n, h). Indeed, 

k'-s(n,k) = [u”](ln(l +u)) k . 


n\ 


Then we can apply the Lagrange Inversion formula for the power series/(w)= 
(exp li )-1 and/ :l> (M / )=/w(l+M). We get 




k 

n 


ni n 

n—k 




U 


= -. „T" -‘lyi-p-Lr: 

h=0 


n 


n + h\ h 


x h . 
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The last step is certainly counterintuitive. We have made a simple expression a 
lot more complicated. The reason for this is that now we can recognize S(n-k+h, 
h) in the last term. Indeed, it is proved in Exercise 29 that 

OO yi I 


Therefore, 


l. n-fei ( e " ~ J \ x h _ h'-Sjn ~ k + h, h) 

1 u ) X ~ (n-k + ti)\ ’ 

and the first equality of the formula of Schlomilch is proved by routine 
cancellations. One then obtains the second equality by substituting the exact 
formula for S(n-k+h, h) given in Lemma 1.16. 


Exercises 

1. Prove that A„ has nil 2 elements for ri>2. 

2. Find a subgroup of >.S’„ that has (n-l )! elements. 

3. For p g S n ,, prove that p and p * l 2 3 4 5 6 7 8 9 10 have the same number of inversions. 

4. For p,q £ Sn, prove that the number of fixed points of pq and qp is the 
same. Try to find a solution using linear algebra. 

5. Find a surjective homomorphism f: S„—>S 2 

6. (Basic knowledge of Linear Algebra required). It is routine to check that the 
map f(p)=A p mapping each element of S n to a permutation matrix is a 
homomorphism. Is this homomorphism irreducible? That is, is there a 
nonzero w-dimensional vector v (say, with real coefficients) for which Ap(v)=v 
for all p £ S n ? 

7. Let ?z>3. Assume we know that f £ S n is such that fg=gf for all g £ S n - 
What can/bc? 

8. Let nbc an even positive integer. Prove that at least half of all ^permutations 
contain a cycle of length at least n/2. 

9. Let L(n, k) be the number of //-permutations whose longest cycle is of 
length k. Assume that j < k < n.. Find a formula for L(n, k). 

10. Let L(n, k) be defined as in the previous exercise. 

(a) Let < k < j. Find a formula for L(n, h). 
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11 . 


12 . 

13. 

14. 

15. 

16. 

17. 

18. 


19. 

20 . 


21 . 


22 . 


(b) Now let \<Lk<n. Find a formula for L(n, k). 

An adjacent transposition is a transposition interchanging two consecutive entries 
in a permutation, that is, it is a cycle of the form (i z+1). Prove that any 
element of S„ can be obtained as a product of (not necessarily distinct) 
adjacent transpositions. 

Let p £ S n be a permutation that can be obtained as a product of rn 
transpositions and also as a product of k transpositions. What can be said 
about m+k ? 

Let n>l. Find all values of n for which S„ does not have two conjugacy 
classes of the same size. 

Prove that any even permutation of S n can be obtained as a product of (not 
necessarily distinct) 3-cycles. 

Prove formula (3.1). 

Let 7'bc a tree on vertex set [n], with degree sequence d h d 2 , d n . Prove 
that the number of all cyclic permutations of [n] that are generated by the 
transpositions defined by the edges of Tis D(lj= dp.dp. ■■■ d n \. 

Find a simple closed formula for c(n, 1) and c(n, n- 1). 

Prove, preferably combinatorially, that 


z{n, k) 


n\ 

kl 


£ 

ri+r-i -1-bi>=-n 


nr 2 ■ ■■r k 


where the r, are positive integers. 

Prove a formula for S(n, k) that is similar to that of the previous exercise. 
+Give a generating function proof of the identity 

n 

s(n + 1 , k + 1 ) = £ (—l)" _m (n)n-m • s(m, k). ( 3 . 17 ) 

m=k 

Define a k (n)=S(n+k, n) and b k (n)=c(n, n-k). Prove that for any fixed positive 
integer k, both a k (n) and bfn) are polynomial functions of n. What is their 
degree and leading coefficient? 

Let afn) and bfn) be defined as in the previous exercise. We have just 
proved that these functions are polynomials of n. We can therefore substitute 
any real number into these polynomials, not just positive integers. Prove 
that for all positive integers k and all integers n, we have 
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a k (G)-a k (-l)=-=a k (-k)=0, 


(b) 


a t (-n)=b t (n). 


23. Give a combinatorial proof of the identity 


n 


c(n + 1, k + 1) = ^ ( n) m c(n , m), 


rn —0 


where we set (n) 0 =1 . 

24. Let C 2 (n, k) be the number of w-permutations with k cycles in which each cycle 
length is at least two. Find a recursive formula for the numbers cy(n, k). 

25. Let Co (n, k) be defined in a way analogous to the previous exercise. Find a 
recursive formula for the numbers (%(n, k). 

26. Find a closed form for the generating function 



n>0 k—0 


27. A box originally contains m white balls. We run a multiple step experiment 
as follows. At each step, we draw one ball from the box at random. If the 
drawn ball is white, we put a black ball in the box instead of that white ball. 
If the drawn ball is black, we put it back to the box. For k<n, compute the 
probability of drawing k white balls in n trials. What is the connection 
between these probabilities p(k, n), and the Stirling numbers of the second 


kind? 

28. Prove that S(n,k ) = 1 2k<m<n &(' m ~ b & ~ 1 )k n ~ m . 

29. Find the exponential generating function tp-(w) = k)u n /nl 

30. Find a simple combinatorial definition for the numbers B(n, k) so that we 


have 



k<m<n 


Then find a formula for the numbers R(n, k). 

31. Let p be a prime number. Prove that c(p, k) is divisible by p unless h=p, or 


k=l. 
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32. (Wilson’s theorem.) Let p be a prime. Prove that l+(/>-l )! is divisible by p. 

33. Let p be a prime. Prove that S(n, k) is divisible by p, unless k=p or k=l. 

34. (a) Prove that 


E 

l<ii <«2 <•••<*«— 


hi 2 ■ • • i n -k - S(n, k). 


(b) Prove that 

Yl (hh-'-in-k ) 1 = S(n,k,l), 

l<ii<i2<"-<i n -k<k 


where S(n, k, l) is the /-Stirling number of the second kind as defined 
in Problem Plus 3 of Chapter 1 . 

35. Prove that 


i\H ■ ■ • i n -k = c(n, k). 

I<il<i 2 <-<i n -k<n—l 


36. Prove (preferably with a combinatorial argument) that for fixed positive 
integers n and k satisfying k<n, we have 


n 

c(n,m)S(m,k ) 

m=k 


n! /n — 1\ 

J\\k- l )' 


37. Give a proof of Corollary 3.57 without the use of generating functions. 

38. Prove that D(n) (the number of derangements of length n) is the integer 
closest to n\/e. 

39. Which set is larger, the set of all derangements of length n, or the set of all 
w-permutations with exactly one fixed point? By how much? 

40. Give a combinatorial proof (no generating functions, or recursive relations) 
for your answer to the previous exercise. 

41. Let p be an w-permutation, and let aa(p) be the the smallest of all the ascents 
of p, if p has any ascents, and let aa(p)=n otherwise. A disarrangement is a 
permutation for which aa(n) is even. For example, 213 and 54312 are 
desarrangements. Let J(n) be the number of desarrangements of length n. 

(a) Prove that J(n)=nJ(n-l)+(-l) n . 
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(b) Prove that J(n)=D(n). 

42. Find a combinatorial proof for the identity J(n)=D(n), where J(n) is defined 
in the previous exercise. 

43. Prove combinatorially that D(n)=(n-l)(p(n-l)+D(n-2)). 

44. Prove that for any fixed n, the sequence {S(n, is log-concave. (Be 

careful.) 

45. Prove that 


det Cc = det 


( c(n + 1,1) c(n + 1,2) • • • c(n + 1, k) N 

c(n + 2,1) c(n + 2,2) • ■ ■ c(n + 2, k ) 

^c(n + k, 1) c(n + &, 2) • • • c(n + k,k)) 


= (n!) 


n* 


46. How many w-permutations have exactly three left-to-right maxima and 
exactly one left-to-right minimum? 

47. Give a combinatorial proof for the result of Corollary 3.55. 

48. [The Compositional Formula] Let K be a field of characteristic zero, let f. 
P —>Kbe a function, and let g. N— >Kbe a function satisfying g(0)=l. Define 
h: N-^ifby 

h (n) = ]T)/( l^il)/(l^2|)'"/(|A m |)ff(m), 


where the sum ranges over all partitions (A h A A) of [n] into any 
number of parts. Let F(x) (resp. G(x),H(x)) be the exponential generating 
function of the sequence f(n) (resp. g(n), h(n)). Prove that 


exp (H(x))=G(F(x)). 

49. Find a formula for the number of w-permutations whose third power is the 
identity permutation. 

50. Find a formula for the number of permutations of length n in which each 
cycle length is divisible by k. 

51. Find the exponential generating function A(x) for the numbers a(n) of n- 
permutations that have an even number of cycles of each odd length. 

52. Find the exponential generating function B(x) for the numbers b(n) of 
partitions of [n] that have an even number of blocks of each even size. 

53. Prove combinatorially that ODD(2m)=EVEN(2m). 

54. Find a combinatorial proof for Exercise 50. 
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55. Find a formula for the number of permutations of length n in which no 
cycle length is divisible by k. 

56. Find a bijective proof of the equality ODD(2n) ■ (2n+ 1 )=ODD(2n+ 1). 

57. Find a bijective proof of the equality 

ODD{2n+l)-{2n+l)=ODD{2n+2). 

58. (a) Prove that the number of all fixed point-free zz-permutations with descent 

set {z} is equal to ). 

(b) What can we say about the number of all zz-permutations having exactly 
one fixed point whose descent set is equal to {z}? 

59. Find a bijective proof of the equality SQ(2n)-(2n+l)=SQp2n+l). 

60. Prove, without using generating functions, that SQ(2n+l)-(2n+2)> SQ(2n+2). 

61. Exercise 6 of Chapter 1 provided a probabilistic interpretation for the 
Eulerian numbers. Why cannot we use that interpretation to deduce by 
Levy’s theorem (Theorem 3.37) that the Eulerian polynomials have real 
roots only? Note that the Eulerian polynomials have irrational roots in 
general, while the interpretation of the mentioned exercise would yield 
rational roots. 

62. Find a combinatorial proof for the fact that 

D(n) * 1 2 3 4 =D(n-l)D(n+l), 

if zz>3. Here D(n) denotes the number of derangements of length n. 


Problems Plus 

1. Prove combinatorially that D(n)=nD(n-l)+(-l) n . 

2. (a) Prove that SQp2n+2)<(2n+2)SQp2n+l). 

(b) For what values of n does equality hold? 

3. Let p 2 (n) be the probability that a randomly selected zz-permutation has a 
square root. Exercises 59 and 2 show that p 2 (n) is a decreasing sequence. 
Show that p 2 (n)— >0. 

4. Find a combinatorial proof for Exercise 55. 
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c(n + 1,3) 


Til 

~2 



(b) Show that 

c(n + 1,4) 



6. Let cj>2 be a prime, and let pfn) be the probability that a randomly selected 
w-permutation has a qth root. Prove that p q (n)=p q (n+ 1), except when ra+1 is 
divisible by q. 

7. Show that 

c(n + 1, k 4-1) ~ n![ln(n + 1) + C] k /kl. 


What can we say about the constant C above? 

8. Show that 

S(n,k)c^. 


9. In this problem, we are investigating how many inversions an w-permutation 
can have if it has a fixed number, k, of cycles. Recall that i(p) denotes the 
number of inversions oip, and that c(p) denotes the number of cycles of p. 
Let 


b(n,p) = min{i(p)|p 6 S„, c(p) = k}, 


and 


B(n,p) = max{i(p)|p € S„, c(p) = k}. 

(a) Find a formula for b(n, k). 

(b) Find a formula for B(n, k). 

(c) Find a formula for the number m(n, k) of w-permutations with ^cycles for 
which the minimum is attained, that is, that have b(n, k) inversions. 

(d) Find a formula for the number M(n, k) of w-permutations with A-cycles 
for which the maximum is attained, that is, that have B(n, k) inversions. 

10. An r-derangement is a permutation in which each cycle length is larger than 
r. So a O-derangement is a permutation, while a 1-derangement is a 
derangement. Let 

G n , r {x) = X> C(P) , 

V 
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where the sum is taken over the set Dfn) of all r-derangements of length n. 
Prove that for any fixed n and r, the polynomial G„„(x) has real zeros only. 

11. (a) Let D be a conjugacy class of S„. Prove that the polynomial 

E d (x) = E 

p€D 

where e(p) denotes the number of excedances of p, has real zeros only. 

(b) It follows from part [(a)] that the polynomial Eo(x) = f2 peD x e ^ is 
unimodal. Where does it take its maximal value? 

(c) Now let D kr (n) be the set of all w-permutations that have exactly k 
cycles, and each of these cycles are longer than r. Prove that the 
polynomial 

E D , k , r , n (x) = Y, Xe{P> 

p€i>*, r {n) 

is symmetric and unimodal. 

(d) Prove that the polynomial 

%(n)W= E X€(T) 
peD r {n) 

is symmetric and unimodal. Here D r (n) denotes the set of all r- 
derangements as defined in the previous Problem Plus. 

12. Exercises 41 and 42 defined desarrangements, and discussed some of their 
close connections to derangements. This problem takes that approach 
further. 


Let T £ [n — 1]. Prove the following statement. The number of 
derangements of length n having descent set T is equal to the number of 
desarrangements of length n whose inverse has descent set T. 


13. 


It follows from the solution of Exercise 22 that there exist integers B t i , with 
i £ [k\. such that 


E ak{n)x” 

k =0 


EjU B kpP 

(1 - x ) 2k + 1 


and 


E bk{n) x ‘ 


fe=0 


E<=*+1 B'ik-i+ipP 
(1 - ar ) 2 * +1 


where B k j = (2Ar — 1)!!- Prove that the numbers B kt are always 
nonnegative by finding a set of permutations that these numbers 
enumerate. 


2004 by Chapman & Hall/CRC 




In Many Circles. Permutations as Products of Cycles. 


123 


14. Let S be any proper subset of [w-1]. Prove that the number of fixed point- 
free w-permutations with descent set S is equal to the number of n- 
permutations with one fixed point having descent set S. 

15. Let She any subset of [n- 1]. Prove that there as many involutions on [n] 
with descent set S as involutions on [n] with descent set [wT]-S. 

16. Let p n>k be the number of w-permutations in which every cycle has less 
than k elements. Prove that for any fixed k, the sequence (p nJk )ka. is bg- 
convex, that is, 


Pn,k — Pn-l,kPn+l,k- 

17. Generalize Exercise 41 in the following way. Let F %l be the number of n- 
permutations with exactly k fixed points. Let E„ k be the number of n 
permutations pipz---p n in which p k p 2 ---p k is a increasing seqence, and 
pk+\Pk+ 2 " 'pn is 21 desarrangement. A little thought shows that for each n- 
permutation, there is exactly one nonnegative k with this property. Prove 
that F n , k =E„, k 


Solutions to Problems Plus 

1. This was first proved byJ.Remmel in [164] as a corollary to a more general 
argument involving ^-analogues. Other proofs can be found in [70] and (in 
a slightly different context), in [199]. 

2. (a) The proofs are different for the cases of odd n and even n. The case of 

even n can be proved very similarly to Exercise 59. For odd n, 
however, we cannot solely rely on our previous methods, because 
2w+2 is divisible by 4, and so there are permutations in SQ^.n+2) 
whose odd part is empty. (And therefore, their even part is of length 
2n+2, and as such, cannot be found in shorter permutations). One 
can show, however, that the number of this kind of permutations is 
very small if n> 8. 

(b) Equality holds if and only if n= 1. As part [(a)] shows that 
SQ(2n+l)<SQ(2n+2) for n> 8 , this can be seen by checking the cases of 
n= 1, 3, 5, 7. 

3. A more general version of this result is proved in [30]. 

4. The first combinatorial proof was given in [26]. A slightly simpler proof 
appears in [22]. 
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5. (a) Formula (3.3) is clearly equivalent to 


n 


y; c(n + 1 ,k + l)x k = (x + l)(x + 2) ■ ■ ■ (x + n) 


= n\(l + x) (l + |) •■ (l + f) - 


Then c(n+ 1, 3) is the coefficient of x 2 on the right-hand side. The proof 
then follows by routine computations. 

(b) Similar to part (a). 

6. This result is proved in [30]. 

7. This can be proved like Problem Plus 5, with more complex computations 
involving symmetric functions. See [62] or [56] for those details. Here Cis 
the Euler constant, that is, 



8. We have seen in Lemma 1.16 that k\S(n,k ) = ]T^L 0 (— 1)*(*){& — *)” 
We claim that all summands of the right-hand side are negligibly small 
compared to the last one. That is, we claim that for fixed k and fixed 
i 6 [k], we have that 


(*-»)"(■) 

k n 


as n— >8. This is clearly true as < 2 k and (k-i)/k<. 1. 

9. This line of research was essentially initiated by P.Edelman [78]. 

(a) We have b(n, k)=n-k. On one hand, b(n, k)= n-k is easy to see for 
instance by induction. On the other hand, the permutation (12--W- 
k+l)(n-k+2)(n-k+3)---(n) shows that this is indeed possible. 

(b) We have 



(c) Let us call a cycle ( c 1 c 2 --c m ) unimodal if the sequence of entries c h 

c m is unimodal. Let us call a permutation p unimodal if all its cycles 
(when started with their smallest elements) are unimodal, and when 
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the underlying sets of all cycles of p consist of intervals of positive 
integers. For instance, (1342) (58976) is a unimodal permutation. Then 
it can be proved that b(n, k) is attained by p if and only if p is a unimodal 
permutation. This implies that for \<k<n-l, the number m(n, k) we are 
looking for is given by 


m(n, k) = ^2 
r =0 


-k- 
r — 1 


1\ (k 


-iii—k—r 


(d) If n — 1 > k > [" j"|, then B (n, k) is attained by only one permutation. 
This permutation p is given by 


p{i) 


n + l- i, if i G [1, n — k] U [fe + 1, n] , 
i, if i 6 [n — k + 1, k] 


In other words, p is an involution that leaves the middle of the set 
pi] clcmcnt-wisc fixed. 

If k < \ y], then the numbers M(n, k) we are looking for are given 


M(n,k) = 'jr 


rf i - fc -1\ 
r — 1 J 


C ) 3rsi ’" ry - 


See [78] for further details, proofs and generalizations. 

10. This result is due to Francesco Brenti, and can be found in [47]. 

11. (a) This result (and the rest of the results in this exercise) is due to Francesco 

Brenti, and can be found in [48]. In fact, it is proved in that paper that 


e dW =i n 


Z D . 
1=1 


A d ,-i(x) 
(x - 1 )! ’ 


where d=(di, cp, ■■■, d k ) is the partition defining the conjugancy class D, 
the Aj(x) are the Eulerian polynomials, and finally, 
z D =f[[i>i if 1 (d)m,i (d )!. In this last formula, m,(d) is the multiplicity of 
i as a part of d. 

(b) It is proved in [48] that E D (X) is symmetric. Therefore, it has to take 
its maximal value in the middle. This is then proved to be at (n-m)l 2, 
where m is the number of fixed points in the partition defining D. (If 
this is not an integer, then there are two maximal values, and they are 
bracketing (n-m)l2. In other words, (n-m)l 2 is always the center of symmetry 
of the sequence of coefficients of E D (x).) One main tool of the proof is 
the result of Exercise 24 of Chapter 1 . 
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(c) Let D be a conjugacy class of S„, and let the partition defining D be (dj, 
<a! 2 , d k ), where dfr for i E [Jfc], Then it follows from part [(b)] that 
E d (x) is a symmetric and unimodal polynomial with center of symmetry 
nl 2 as permutations in D have no fixed points. That is, the center of 
symmetry does not depend on D. Our definitions imply 



D 


where D ranges over all conjugacy classes that are given by partitions 
with all parts larger than r. The claim then follows as the right-hand 
side is the sum of symmetric and unimodal polynomials with center 
of symmetry n!2. 

(d) As we have 



part [(c)] implies our claim. 

12. The first proof of this result was given in [72] by the use of symmetric 
functions. Two further, and more bijective, proofs were given in [73], one 
of which is the proof of a special case of a more general theorem. 

13. This result is due to R.Stanley and I.Gessel [108]. Let (f be the set of 
Stirling permutations, that is, the set of all permutations pip 2 —p 2 „ of the 
multiset {1, 1, 2, 2, ■■■, k, K\ in which m<xKh/ and p u =p w imply pv>pu. Then 
the number B ki is equal to the number of permutations pjp 2 •• -p 2 n £ Qk 
that have exactly i descents in the following sense: pj>pj+i or j=2k for 
exaedy i values of j E [2fc].. The above reference contains two proofs of 
this fact, a combinatorial one and an inductive one. 

14. A proof using symmetric functions can be found in [107]. It would still be 
interesting to find a bijective proof. 

15. The first proof is due to V.Strehl [184] who used the Robinson-Schensted 
bijection in his proof. A very short proof using quasi-symmetric functions 
was given in [107], 

16. This follows from the following general result of Bender and Canfield [21]. 
Let Aj, X 2 , be a log-concave sequence of nonnegative real numbers, and 
define the sequences A„ and P„ by 
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Then the A„ are log-concave and the P„ are log-convex. 

In order to get our result from this theorem, set Xj= 1 for j<k, and set Xj= 0 
otherwise. Then the far right-side of the above formula simplifies to exp 
fepn ^f~)i which is precisely the exponential generating funcdon for 
the numbers p n)k . In other words, in this special case, we getp=p n:k . 

17. This result is proved in [72] in two different ways. That paper also contains 
further generalizations, such as refinements of the proved equality with 
respect to descent sets. 
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In Any Way But 'This. Pattern 
Avoidance. ‘The Basics. 


4.1 The notion of Pattern avoidance 

In earlier chapters, we have studied inversions of permutations. These 'were pairs 
of elements that could be anywhere in the permutation, but always related to 
each other the same way, that is, the one on the left was always larger. 

There is a far-fetching generalization of this notion from pairs of entries to k- 
tuples of entries. Consider a “long” permutation, such as /=25641387, and a 
shorter one, say <7=132. We then say that the 3-tuple of entries (2, 6 , 4) in p forms 
a pattern or subsequence of type 132 because the entries (2, 6 , 4) ofj^ relate to each 
other as the entries 132 of q do. That is, the first one is the smallest, the middle 
one is the largest, and the last one is of medium size. The reader is invited to find 
a pattern of type 321 in p. On the other hand, there is no pattern of type 12345 
in p, therefore we will say that p avoids 12345. 

The notion of pattern avoidance is at the center of this entire Chapter, so we 
will make it formal. 


DEFIJVn'IOJV 4.1 Let q — (?1 >• ■ • > <?fc) € Sk be a permutation, and let k<n. We 
say that tlw permutation P = ( Pi,P 2 , " ',Pn) eS n contains q as a pattern if there are k 
entries Pi 1 > Pi 2 > •' *, Pi k in p so that - -<and pi a < Pi b if and only if qPOjb- 

Otherwise we say that p avoids q. 


Example 4.2 

The permutation 3451267 avoids the pattern 321 as it does not contain a 
decreasing subsequence of length 3. It contains the pattern 2134 as shown by the 
entries 4267. [] 

In other words, p contains q as a pattern if p has a subsequence of elements that 
“relate” to one another the same way the elements of q do. 


129 
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4.2 Patterns of length three 

We have enumerated w-permutations with a given number of inversions and 
obtained rather precise results. The corresponding question, that is, a formula 
for the number of w-permutations with a given number of occurrences of a pattern 
q, is in general too difficult. We will therefore first concentrate on the special case 
when this given number is zero. That is, we will try to find the number S n (q) of 
w-permutations that avoid the pattern q. 

Obviously, we have S„(12)=S„(21)=1, so the first nontrivial case is that of 
patterns of length three. There are six such patterns, but as we will shordy see, 
there are many symmetries between them. 

Recall that for the permutation p=pipz...p„, we define the reverse of p as the 
permutation f=p,p n -i .. .pi, and the complement of p as the permutation f whose zth 
entry is n+l-p,. 

It is clear that if a permutation avoids 123, then its reverse avoids 321, thus 
>.S’„(123)=.S’„(321). Similarly, if a permutation avoids 132, then its reverse avoids 
231, its complement avoids 312, and the reverse of its complement avoids 213. 
Therefore we also have S„(132)=S„(231)=S'„(312)=S’„(213). 

If we can prove that 5„(123)=5„(132), then we will have the remarkable 
result that all patterns of length three are avoided by the same number of n- 
permutations. 


LEMMA 4.3 

For all positive integers n, we have S, ! (123)=S'„(132). 

PROOF The are several ways to prove this first nontrivial result of the subject. 
The one we present here is due to R.Simion and F.Schmidt [171]. Recall that an 
entry of a permutation which is smaller than all the entries that precede it is 
called a left-to-right minimum. Note that the left-to-right minima form a decreasing 
subsequence. 

We are going to construct a bijcction f from the set of all 132-avoiding n- 
permutations to the set of all 123-avoiding w-permutations which leaves all left- 
to-right minima fixed. 

The map / is defined as follows. Keep the left-to-right minima of/; fixed, and 
write all the other entries in decreasing order. The obtained permutation/]/) is 
always 123-avoiding as it is the union of two decreasing subsequences, one of 
which is the sequence of all left-to-right minima, and the other is the decreasing 
sequence into which we arranged the remaining entries. 


Example 4.4 

If /M57341258, then the left-to-right minima of p are 6, 3, and 1, therefore 
f(p)= 68371542. □ 
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We would like to point out that the left-to-right minima of p an&f(p) are the 
same, even if some other entries of p have moved. Indeed, we can say that f 
simply rearranges the m entries that are not left-to-right minima pair by pair. 
That is, whenever we (impersonating the function f see a pair of these entries 
that is not in decreasing order, we swap them. This algorithm stops in at most 
(™) steps. Moreover, each step of this algorithm moves a smaller entry to the 
right and a larger one to the left, and therefore never creates a new left-to-right 
minimum. 

We note that this is the only 123-avoiding permutation with the given set and 
position of left-to-right minima. Indeed, if there were two entries x and y that are 
not left-to-right minima and form a 12-pattern, then the left-to-right minimum z 
that is closest to x on the left, and the entries x and y would form an increasing 
sequence. 

Now we prove that/"is a bijection by showing that it has an inverse. Let q be 
an n-permutation that avoids 123. Keep the left-to-right minima of q fixed, and 
fill in the remaining positions with the remaining entries, moving left-to-right, as 
follows. At each step, place the smallest element not yet placed which is larger 
than the closest left-to-right minima on the left of the given position. Call the 
obtained permutation g(q). 


Example 4.5 

If /y=6S371542, then the left-to-right minima of q are 6, 3, and 1. To the empty 
slot between 6 and 3, we put the smallest of the two entries that are larger than 
6 , that is, 7. To the empty slot between 3 and 1, we put the smallest entry not 
used yet that is larger than 3, that is, 4. Immediately on the right of 1, we put the 
smallest entry not used yet that is larger than 1, that is, 2. We finish this way, by 
placing 5 and 8 to the remaining slots, to get g(^=67341258. 

The obtained permutation is always 132-avoiding. Indeed, if there were a 132- 
pattern in it, then there would be one which starts with a left-to-right minimum, 
but that is impossible as entries larger than any given left-to-right minimum are 
written in increasing order. 

Note again that g(q) is the only 132-avoiding permutation that has the same 
set and position of left-to-right minima as q. Indeed, if at any given instance, two 
entries w<i> that are larger than a left-to-right minimum a were in decreasing 
order, then ami would be a 132-pattern. 

This proves that g(f(p))=p, implying that f is a bijection, and proving our 
theorem. 

The techniques and notions used in the above proof will be used so often in 
the coming sections that it is worth visualizing them. Figure 4.1 shows our 
running example, the permutation p= 67341258, and its image, f(p)= 
68371542. 
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FIGURE 4.1 

A 132-avoiding permutation and its image. 

Note again that the left-to-right minima (the decreasing sequence of gray circles) 
are left fixed. In f(p), all the remaining entries form a decreasing sequence, while 
in p, all entries larger than a given left-to-right minimum form an increasing 
sequence. 

Now that we know that S„(q) does not depend on q as long as the length of q 
is three, it suffices to determine S n (q) for one possible choice of q of length three. 


THEOREM 4.6 

For all positive integers n, we have 


( 2n ) 

Sn 132 = C n = 

n + 1 

PROOF Set c,=SJl3‘2). Suppose we have a 132-avoiding w-permutation in which 
the entry n is in the zth position. Then any entry to the left of n must be larger 
than any entry to the right of n. Indeed, if x andy violate this condition, then xny 
is a 132-pattern. Therefore, the set of entries on the left of n must be (w-z+1, n- 
z+2, ..., n- 1}, and the set of entries on the right of n must be [n-i]. Moreover, 
there are c H possibilities for the order of entries to the left of n and c n _ t possibilities 
for the order of entries on the right of n. Summing for all i we get the following 
recursion: 

n—1 

Cn — ^ 1 Cj—lCn—i- (4.1) 

i=0 

Therefore, if C(x) = ffff-Q c n x n is the the ordinary generating function of the 
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c n , then (4.1) implies C?(x)x+ l=C(x), which yields 


C(x) 


1 — s/1 — 4.x 
2x 


(4.2) 


By standard methods this yields 


C{x) = 


n—1 


1 (2n - 2 
n—1 


„7l— 1 



and the statement of the theorem is proved. 


■ 


(4.3) 


Because of the trivial identities S„(132)=S„(231)=S„(312)=S„(213). and 
S„(123)=S„(321), we have proved the following. 


COROLLARY 4.7 

Let q be a pattern of length three. Then we have 


Sn{q) = c n 


n + 1* 


The numbers C„ are called the Catalan numbers. We will shordy see that the 
condition that q is of length three is of crucial importance. 


4.3 Monotone Patterns 

After obtaining satisfying results for the case of patterns of length three, the 
reader may think that we will now turn to patterns of length four, and then to the 
general case, and obtain similarly strong results. Unfortunately, this is quite a 
mountainous task. As we will shortly see, finding an exact formula is difficult 
even for patterns of length four, and is an open problem for all longer patterns. 
There has been, however, a long-standing conjecture (recently proved) that 
connected all patterns by claiming that no matter what q is, the number of re- 
permutations avoiding q is very small compared to re!. 

CONJECTURE 4.8 [Stanley-Wilf conjecture, 1980] Let q be any pattern. Then there 
exists a constant c q so that far all positive integers, we have 

Sn(q ) < c* (4.4) 

Note that the conjectured number c] is very small compared to the number of 
all re-permutations, which is re!. In other words, this is a quite ambitious conjecture. 
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The following conjecture may look even more ambitious, but that is a false 
appearance. 


COJVJECTURE 4.9 [Stanley-Wif conjecture, alternative version] Let q be any pattern. 
Then the limit 


lint y/Sn(q) 

n —»oo 

exists. 


The first published proof of the fact that the above two versions of the conjecture 
are equivalent is given in [8]. On one hand, it is obvious that Conjecture 4.9 
implies Conjecture 4.8. On the other hand, to prove the implication in the other 
direction, we claim that S n (q)S m (q)^S n+m (q), for all patterns q, and all positive 
integers n and m. Indeed, we can assume by symmetry that q is a pattern in 
which the maximal element k precedes the minimal element 1. Now let pi and p 2 
be permutations of length n and m, respectively, that avoid q. Consider the 
concatenation of pi and p 2 , and add m to each entry of p\. This results in a 
permutation P E Sn+m that clearly avoids q, and therefore injectively proves 
that S n (q)S m (q)^S n+m (q). Therefore the sequence \/S n (q) is bounded and 
monotone, and must thus be convergent. 

After almost a quarter century, the Stanley-Wilf conjecture has recendy been 
proved by a spectacular argument [149]. We will present that proof in Section 
4.5. For now, we will look at some special cases in which more precise results are 
available. 

We have seen in the previous section that the Stanley-Wilf conjecture is true if 
q is of length three, with c= 4. Indeed, C n < ( :! f) < 4". There is only one 
other class of patterns for which it is similarly easy to prove that Stanley-Wilf 
conjecture, namely monotone patterns. 

THEOREM 4.10 

For all positive integers n and k>2, we have 

S,( 123-£)<(M) 2 ” 

PROOF Let us say that an entry x of a permutation is of rank i if it is the end of 
an increasing subsequence of length i but there is no increasing subsequence of 
length i+1 that ends in x. Then for all i elements of rank i must form a decreasing 
subsequence. Therefore., a ^-avoiding permutation can be decomposed into the 
union of k-1 decreasing subsequences. Clearly, there are at most (k-Y) n ways to 
partition our n entries into k-\ blocks. Then we have to place these blocks of 
entries somewhere in our permutation. There are at most (k-Y)" ways to assign 
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each position of the permutation to one of these blocks, completing the 
proof. 


Note that for k=3, we get that 5'„(123)<4" ) agreeing with our earlier results. As 
we have seen that S„(123)=C % we see that the constant 4 provided by Theorem 
4.10 is actually the best possible constant. This is not an accident. 

Indeed, Theorem 4.10 has a stronger version, which needs heavy analytic 
machinery, and therefore will not be proved here. We mention the result, however, 
as it shows that no matter what k is, the constant ( k-\ ) 2 cannot be replaced by a 
smaller number. We remind the reader that functions f(n) and g(n) are said to be 
asymptotically equal if lim^-, = 1. 


THEOREM 4.11 

[161] For all n, 5„(1234--T) asymptotically equals 


Here 


(Ar-1 )*> 
* — 2£)/2 


A* = 7 if I “ J [J3(sfi,*2,"*i Xk)-e (k/2}x2 J 2 dx 1 dx 2 ---dx k , 

11>X2> --- > Xk 

where D(xi ,jr 2 ,---,x t ) II j<j (X* - x$), and = (1/VS)* -1 - fe*- 2 / 2 . 
Note that the multiple integral in the above formula evaluates to a constant. 


4.4 Patterns of length four 

In this section we will study S n (q) for patterns of length four. There are 24 patterns 
of length four, but there are many (trivial and nontrivial) equivalences between 
them. First, we will use these equivalences to significantly decrease the number 
of patterns that need individual attention. 

By taking reverses and complements, we can restrict, our attention to those 
patterns of length four in which 


(a) the first element is smaller than the last one and 

(b) the first element is 1 or 2 . 

Thi s still leaves us 11 patterns, 1234, 1243, 1324, 1342, 1423, 2134, 2143, 
2314, 2341 and 2413. Note that if p contains q, then the inverse of p clearly 
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contains the inverse of q (as the inverse of a permutation matrix is its transpose), 
so S„(q)=S n (cf'). Therefore, we can drop 1423, too, as its inverse 1342 remains 
on the list. Similarly, we can drop 2314 as its complement is 3241 and the reverse 
of that is again 1423. 

We cannot proceed further without new tools. The following general theorem 
of Backelin, West, and Xin has a quite involved proof, but some of its special 
cases are easier. 

THEOREM 4.12 

[16] Let k be any positive integer, and let q be a permutation of the set { k+ 1, k+2 , ■ • •, k+ r). 
Thenfor all positive integers n, we have 


£(123 • • ■ k])=S n (k(k-l) • • -1 ? ). 


For instance, if r= 2, k=2 and <7=34, then Theorem 4.12 says that 
£(1234)=£(2134) . If <p=43 and the other parameters do not change, then 
Theorem 4.12 says that £,(1243)=£(2143). As 2134 and 1243 are reverse 
complements of each other, the patterns 2134,2143, and 2134 can all be removed 
from our list. Now let r= 3, and k=l, then clearly < 7 = 4 , and Theorem 4.12 shows 
that £(1234)=£(3214). Therefore, by taking complements, we can remove 2341, 
and its reverse 1432, from our list. 

This leaves us with the patterns 1234, 1324, 1342, and 2413. The following 
result of Stankova eliminates one more pattern. 

LEMMA 4.13 

[174] For all positive integers n, we have 

£(1342)=£(2413). 

We note that Stankova proved her result in the equivalent form of £,(4132)= 
£(3142). 

So we are left with three patterns, 1234, 1342, and 1324. It is high time that 
we took a look at numerical evidence computed byJ.West in [196]. This evidence 
shows that as n grows starting at n=l, 

• for £(1342) we have 1, 2, 6, 23, 103, 512, 2740, 15485 

• for £(1234) we have 1, 2, 6, 23, 103, 513, 2761, 15767 

• for £(1324) we have 1, 2, 6, 23, 103, 513, 2762, 15793. 

These data are startling for at least two reasons. First, the numbers £ (q) are no 

longer independent of q. That is, there are some patterns of length four that are 
easier to avoid than others. Second, the monotone pattern 1234 does not provide 
either extremity. That is, it seems that the monotone pattern is neither the easiest 
nor the hardest one to avoid. 
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The numerical evidence shown above also raises some intriguing 
questions. 


QUESTION4.14 Is it true thatfor all w>7, we have 5„(1234)<5„(1324)? 


QUESTION 4.15 Is it true thatfor all 6, we have 5', 1 (1342)<6', ! (1234)? 


QUESTION 4.16 In general, what makes a pattern easier to avoid than another 
pattern? 


QUESTION 4.17 Is it true that if S„(qi)KS n (q^) for some n, then for all N>n, we have 

In this section, we will answer questions 4.14 and 4.15 in the affirmative. There 
is no known answer for question 4.16, but we will mention some related interesting 
facts. The answer for question 4.17 is, in general, negative. The first 
counterexample [175] is for two patterns q\ and q 2 of size five, and for w=12. So 
whether a pattern is easy or hard to avoid depends in some cases not just on the 
pattern itself, but also on n. 


4.4.1 The Pattern 1324 

The following result is the earliest example [40] in which a pattern was shown to 
be more restrictive than another pattern of the same size. 


THEOREM 4.18 

For all ii>l , we have 


5„(1234)<5„(1324). 

The first step in our proof is the following classification of all w-permutations. 
DEFINITION 4.19 Two permutations x andy are said to be in the same class f 

• the lef-to-right minima of x are tlw same as those ofy, and 

• the left-to-right minima of x are in the same positions as the left-to-right minima ofy, 
and 

• the above two conditions hold for the right-to-left maxima of x andy as well. 
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Example 4.20 

Permutations 3=51234 and y=5 1324 are in the same class, but 2=24315 and 
u=24135 are not, as the third entry of z is not a left-to-right minimum whereas 
that of v is. 

The outline of our proof is going to be as follows: we show that each nonempty 
class contains exactly one 1234-avoiding permutation and at least one 1324-avoiding 
permutation. Then we show that “at least one” means “more than one” at least 
once, completing the proof. 

The first half of our argument is simple. 


LEMMA 4.21 

Each nonempty class contains exactly one 1234-avoiding permutation. 


PROOF Suppose we have chosen a class C, that is, we fixed the positions and 
values of all the left-to-right minima and right-to-left maxima. It is clear that if 
we put all the remaining entries into the remaining slots in decreasing order, 
then we get a 1234-avoiding permutation. Indeed, the permutation obtained 
this way consists of 3 decreasing subsequences, that is, the left-to-right minima, 
the right-to-left maxima, and the remaining entries. Thus, if the permutation 
just constructed contained a 1234-pattern, then by the pigeon-hole principle 
two of the entries of that 1234-pattern would be in the same decreasing 
subsequence, which would be a contradiction. Note that if C is nonempty, then 
we can indeed write the remaining entries in decreasing order without conflicting 
with the existing constraints—otherwise C would be empty. This can be seen as 
in the proof of Lemma 4.3. Therefore, C contains at least one 1234-avoiding 
permutation. 

On the other hand, we claim that the decreasing order of the remaining entries 
is the only one that will result in a 1234-avoiding permutation. Indeed, if we put 
two of the remaining entries, say a and b, in increasing order, then together with 
the rightmost left-to-right minimum on the left of a and the leftmost right-to-left 
maximum on the right of b they would form a 1234-pattern. 

The second half of our argument is somewhat subtler. 


LEMMA 4.22 

Each nonempty class contains at least one 1324-avoiding permutation. 


PROOF First note that if a permutation contains a 1324-pattern, then we can 
choose such a pattern so that its first element is a left-to-right minimum and its 
last element is a right-to-left maximum. Indeed, we can just take any existing 
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pattern and replace its first (last) element by its closest left (right) neighbor which 
is a left-to-right minimum (right-to-left maximum). Therefore, to show that a 
permutation avoids 1324, it is sufficient to show that it does not contain a 1324- 
pattem having a left-to-right minimum for its first element and a right-to-left 
maximum for its last element. Such a pattern will be called a bad pattern. Also 
note that a left-to-right minimum (right-to-left maximum) can only be the first 
(last) element of a 1324-pattern. 

Now take any 1324-containing permutation. By the above argument, it has a 
bad pattern. Interchange its second and third element. Observe that we can do 
this without violating the existing constraints, that is, no element x goes on the 
left of a left-to-right minimum that is larger than x, and no element y goes on the 
right of a right-to-left maximum that is smaller thany. The resulting permutation 
is in the same class as the original because the left-to-right minima and right-to- 
left maxima have not been changed. Repeat this procedure as many times as 
possible, that is, as long as 1324-patterns can be found. Note that crucially, each 
step of the procedure decreases the number of inversions of our permutation by at least one. 
Therefore, we will have to stop after at most (”) steps. Then the resulting 
permutation will be in the same class as the original one, but it will have no bad 
pattern and therefore no 1324-pattern, as we claimed. 


We are only one step away from proving Theorem 4.18. We have to show that if 
w>7, then classes that contain more than one 1324-avoiding permutations indeed 
exist. Let n=l. then the class 3* 1*7*5 (with the stars denoting the positions of the 
remaining entries) contains two such permutations, namely 3612745 and 3416725. 
If w>7, then put the entries n, n- 1, ..., 8 in front of 3*1*7*5, with no additional 
stars. The obtained class will again contain two 1324-avoiding permutations, 
coming from the subwords 624 and 462 of the remaining entries. This completes 
the proof of Theorem 4.18. 

At this point readers with a penchant for asymptotic results will surely say 
something like “OK, S„(1324) is larger than S„(1234), but is it really significandy 
larger?” That is, could it be that S, ,(1234) and 5„(1324) are in fact asymptotically 
equal, meaning that (5„(1234)/5„(1324))—>1? The following Proposition answers 
this question in the negative. 


PROPOSITION 4.23 

The sequences <S„(1234) and <S„(1324) are not asymptotically equal. 


PROOF See the solution of Exercise 29. 

There is a weaker notion of two sequences growing roughly at the same 
rate, namely that of logarithmic asymptotics. We say that two sequences f(n) 
and g(n) are logarithmic asymptotically equal if f(f(n) / g(n)) —» 1 . It is not known 
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whether .S’„(1234) and -S’„( 1324) are logarithmic asymptotically equal or not. 
It follows from Theorem 4.11 that f S n {\2?A) -4 9, so one would “only” 
need to decide whether lim l30 tfSff 1324) > 9 or not in order to solve this 
open problem. 


4.4.1.1 An exponential upper bound 

We have seen that S„ (1324)>.S’„(1234) for n>7. Therefore, we cannot infer the 
Stanley-Wilf conjecture for 1324 from Theorem 4.10; we need a new proof. 
While the Stanley-Wilf conjecture has recendy been proved for all patterns, we 
will see in Section 4.6 that there is significant room for improvement as far as the 
constant c goes in the exponential upper bound c“. For this reason, and for reasons 
that we will explain at the end of this subsection and in Section 4.6, it is still 
worthwhile to look for good upper bounds on .S’„(1324), and for new techniques 
to prove upper bounds on S n (q). 

The following simple definition will be unexpectedly helpful. 


DEFINITION4.24 We will say that an n-permutation p=p\pr • -p„ is orderly ifp\<p n . 
We will say that p is dual orderly if the entry 1 ofp precedes t!w maximal entry n ofp. 

It is clear that p is orderly if and only if p~ l is dual orderly. 

The importance of these permutations for us is explained by the following lemma. 


LEMMA 4.25 

The number of orderly (resp. dual orderly) 1324-avoiding n-permutations is less than 
8 " 

4(n+l)' 


PROOF It suffices to prove the statement for orderly permutations as we can 
take inverses after that to get the other statement. 

The crucial idea is this. Each entry pi of/; has at least one of the following two 
properties. 

(a) f>pu 

(b) pi<p n ; 

Define S={ if^pp and T={i\p,<pt) . Then S and T are disjoint, S U T = [n], and, 
crucially, if?' £ T, then in particular /?,</>„. Let |S|=j and | T\=t. Then we have C,-i 
possibilities for the substring ps of entries belonging to indices in S, and C=C M 
possibilities for the substring pr of entries belonging to indices in S. Indeed, ps 
starts with its smallest entry, and then the rest of it must avoid 213, (otherwise, 
together with/?i, a 1324-pattern is formed) and must avoid 132 (otherwise, 
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together w\xh.p n , a 1324-pattern is formed). Finally, we have choices for the 
set of indices that we denoted by S. Once s is known, we have no liberty in 
choosing the entriesp„ (i £ S) as they must simply be the s largest entries. 
Therefore, the total number of possibilities is 


n 


E 


C s —l^n—s 


< 2 n ~ 2 Y, Cs-iC n - s < 2 n ~ 2 C n < 

s =2 


8 " 

4(n + 1) 


We have seen that it helps in our efforts to limit the number of 1324-avoiding 
permutations if a large element is preceded by a small one. To make good use of 
this observation, look at all non-inversions of a generic permutation p=p\pr- -p„; 
that is, pairs so that i<j and ppfij. Find the noninversion (i,j) for which 


max(j - i,pj -pi) ( 4 5 ) 

(*.j) ' ' 

is maximal. If there are several such pairs, take one of them, say the one that is 
lexicographically first. Call this pair (i,j) the critical pair of p. 


Example 4.26 

Let/>=5716243. Then the critical pair of p is (3,4), as p 3 = 1 and pn=f>, so pjpj= 5- 
There is no other non-inversion for which j-i or pjpi would be so high. Q 


Example 4.27 

If the w-permutation p is orderly, then its critical pair is (1, n). 

The following proposition is obvious, but it will be important in what follows, so 
we explicitly state it. 


PROPOSITION 4.28 

For any permutation, the critical pair is always a pair in which thefirst entry is a lefi-to-right 
minimum, and the second entry is a right-to-lefi maximum. 

We will now prove that each class contains at most an exponential number (that 
is, d) of 1324-avoiding w-permutations. 

Choose a class A. By Proposition 4.28, we can see that the critical pair of any 
permutation p £ A is the same as it depends only on the left-to-right minima 
and the right-to-left maxima, and those are the same for all permutations in A. 

We will now find an upper bound for the number of 1324-avoiding n- 
permutations in A. 
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FIGURE 4.2 

A generic 1324-avoiding permutation. 


For symmetry reasons, we can assume that in the critical pair of p £ A, we 
have j-i—pffv i n other words, the maximum (4.5) is attained by j-i. 

We will now reconstruct^ from its critical pair. First, all entries that precede p, 
must be larger than p r Indeed, if there existed k<ii so that pif^pj, then the pair (j, 
k) would be a “longer” non-inversion than the pair (i,j), contradicting the critical 
property of (i, j). Similarly, all entries that are on the right of pj must be smaller 
than p { . 

This shows that all entries p, for which pi<p,<pj must be positioned between p t 
and pj, that is, i < CKj must hold for them. However, if j-i=pj-pi+b, where b is a 
positive integer, then we can select b additional entries that will be located between 
pi and pj. We will call them excess entries, that is, an excess entry is an entry p u that 
is located between p t and pj, but does not satisfy p<tp u <tpj. 

Figure 4.2 shows the diagram of a generic 1324-avoiding permutation. 

The good news is that we do not have too many choices for the excess entries. No 
excess entry can be smaller than p t -b. Indeed, if we had p,<p>,-b for an excess entry, 
then for the pair (u,j) the value defined by (4.5) would be larger than for the pair (i, 
j), contradicting the critical property of (i,j). This is because we would have pjpf>pj- 
(p I -b)=pjp,- s rb=j-i. By an analogous argument, no excess entry can be larger than pj+b. 
Therefore, the set of b excess entries must be a subset of the at-most-(2/;)-clement set 

{{Pi ~ b,Pi - b + 1, • • ■ ,pi - 1} U {pj + l,Pj + 2, • • • ,pj + b}) n [n]. There¬ 
fore, we have at most choices for the set of excess entries, and consequently, we 
have choices for the set of j-i-l+b elements that are located between p, and pj. As 

pf^pj, the partial permutation pp l+ \' • -p, is orderly, and certainly 1324-avoiding. 
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Therefore, by Lemma 4.25, we have less than 8i i+1 /4(/ : z+l) choices for it once the set 
of entries has been chosen. 

This proves that altogether, we have less than 


4 b ■ 


4 (j -i + 1) 


< 32 j ~ l 


possibilities for the string pp M ■ ■ -p r We used the fact that bfj-i-1 as b counts 
the excess entries between i and j. Note that we have some room to spare 
here, so we can say that the above upper bound remains valid even if we 
include the permutations in which the maximum was attained by (p it pj, and 
not by (i,j). 

We can now remove the entries p M ■ ■ -p r \ from our permutations. This will 
split our permutations into two parts, p L on the left, and p R on the right. It is 
possible that one of them is empty. We know exactly what entries belong to p L 
and what entries belong to p R , indeed each entry of p L is larger than each entry of 
p R . Therefore, we do not lose any information if we relabel the entries in each of 
pi and p R so that they both start at 1. This will not change the location and 
relative value of the left-to-right minima and right-to-left maxima either. 

Then we iterate our procedure. That is, we find the critical pairs of p L and pR, 
denote them by (iifi) and (inff, and prove, just as above, that there are at most 
32^ - u. possibilities for the string between i L and ji, and there are at most 32 JH <H 
possibilities for the string between i R and j R . Then we remove these strings again, 
cutting our permutations into four parts, and so on. 

Iterating this algorithm until all entries of p that are not left-to-right minima or 
right-to-left maxima are removed, we prove the following. 


LEMMA 4.29 

The number of1324-avoiding nfermutations in any given class A is at most 32". 


PROOF The above description of the removal of entries by our method shows 
that the total number of 1324-avoiding permutations in A is less than 

32 IN ) 


where the summation ranges through all intervals (i k , j h ) whose endpoints are 
critical pairs at some point. As the interiors of these intervals are all disjoint, 
f^ k {jk — if) < Tl — 1, and our claim is proved. | 

Now proving the upper bound for *S’„( 1324) is a breeze. 
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THEOREM 4.30 

There exists an absolute constant c so thatfor all n, we have 5„(1324)<c". 

PROOF As there are less than 9” classes and less than 32” w-permutations in 
each class that avoid 1324, <=9-32=288 will do. | 


We point out that this is certainly not the best upper bound for <S„(1324). With 
less elegant arguments, the upper bound can be decreased. Nevertheless, the 
conjecture that 6’„(1324)<9" is open. It would be interesting to decide this 
conjecture in either direction. As of now, the smallest constant that has a chance 
to play the role of c q in the inequality S n (q) < c" is c q = (k — l) 2 , where k is 
the length of q. We know from Theorem 4.11 that no smaller constant will do. A 
disproof of the conjecture that 6’„(1324)<9" would show that sometimes a larger 
constant is needed. 

Numerical evidence suggests that for any given k the value of S„(q) is maximized 
by the pattern 1325476 - ■ - . The results of this section prove that this is indeed the 
case for k=4. If we could show that this is true for all pattern lengths k, then an 
upper bound given for S„( 1324576 ■ • •) would be an upper bound for all patterns 
of length k. Exercise 32 and 31 sketch a way to prove an upper bound for 
S„( 1324576- • •), so we would “only” need to show that there is indeed no pattern 
of length k that is easier to avoid than 1325476---. 


4.4.2 The Pattern 1342 

In this subsection, we turn our attention to the pattern 1342. Interestingly, we 
will be able to provide an exactformula for S, ,(1342). This is exceptional; the only 
other pattern longer than three for which an exact formula is known is 1234. 
The formula is given by the following theorem. 

THEOREM 4.31 

For all positive integers n, we have 


S„(1342) 


f _ n n-i (7n 2 - 3 n - 2) 
1 ' 2 


n 

+ 3^(-l) n ^-2 i+1 - 

i=2 


(2 i - 4)1 
*!(* — 2 )! 



This is a very surprising result. It is straightfo rward to p rove from this formula 
that S„(1342)<8“ for all n, and that limn-yoo \J S„(1342) = 8. 

The result itself is not the only interesting aspect of the facts surrounding 
the pattern 1342. We will see that permutations avoiding 1342 are in bijection 
with two different kinds of objects which at first look totally unrelated. The 
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FIGURE 4.3 

A (3(0, l)-tree. 


first, and for our purposes, more important, type of objects is a specific kind of 
labeled trees. 

DEFINITION 4.32 [64] A rooted plane tree with non-negative integer labels l(v) on each 
of its vertices v is called a y8(0, 1 [tree if it satisfies thefollowing conditions: 

• fv is a leaf tlwn l(v)= 0, (this explains the 0 in the name of yS(0, lftrees), 

• fv is the root and v h v 2 , •••, v k are its children, then If) — l(vf)j 

• f v is an internal node and v u v 2 , •••, v k are its children, then If) < 
1 + y^_, Ifk) (this explains the 1 in the name of /3(0, lftrees). 

Example 4.33 

Figure 4.3 shows a (3(0, l)-tree on 12 vertices. 0 

Let us call a permutation p=pf 2 ■ ■ -p„ indecomposable if there exists no k £ (n — 1] 
so that for all i<k<f, we have pf>p r In other words, p is indecomposable if it 
cannot be cut into two parts so that everything before the cut is larger than 
everything after the cut. For instance, 3142 is indecomposable, but 43512 is 
not as we could choose k=3, that is, we could cut between the third and 
fourth entries. If a permutation is not indecomposable, then we will call it 
decomposable. 

The importance of (3(0, l)-trees for us is explained by the following theorem. 
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FIGURE 4.4 

The three y0(O, 1)-trees on three vertices. 


THEOREM 4.34 

For all positive integers n, there is a bijection Ffrom the set findecomposable 1342-avoiding 
n-permutations to the set j}0(O,l) of /f(0, 1 )-trees on n vertices. 


Example 4.35 

Let n= 3. Then there are three indecomposable n-permutations, 123, 132, and 
213, and they all avoid 1342. Correspondingly, there are three [:>((). l)-trees on 
three vertices, as can be seen in Figure 4.4. 

Let t n = | Dn*' 0 ’ 1 ' 1 1- If we can prove Theorem 4.34, then we have made a crucial 
step in advance as it is known [64] that 


in—2 (2 n - 2)1 

(n + l)!(n — 1)! 


(4.6) 


We start by treating two special types of /3(0, l)-trees on n vertices. There are two 
things that contribute to the structure of a y0(O, l)-tree, namely its (unlabeled) 
tree structure, and its labels. We will therefore first look at /3(0, 1)- trees in which 
one of these two ingredients is trivial, that is, /f(0, l)-trees that consist of a single 
path only, and /3(0, l)-trees in which all labels are zero. 


LEMMA 4.36 

There is a bijection ffrom tlw set of1342-avoiding n-permutations starting with the entry 1 
and tlw set f y8(0, l)-trees on n vertices consisting of one single path. 

Note that a simpler description of the domain of / is that it is the set of 231- 
avoiding permutations of the set (2, 3, 4, •••, n). 
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FIGURE 4.5 

The P( 0, l)-tree of^143265. 

PROOF Let p=p\p 2 '' • p n be an 1342-avoiding zz-permutation so that ^=1. Take 
an unlabeled tree on n nodes consisting of a single path and give the label l(i) to 
its zth node (l<z<n-l) by the following rule: 



\ {j < i so that pj > p s for at least one s > i,}\ if i < n 


l(n — 1) if i = n. 


That is, l(i) is the number of entries weakly on the left of p, which are larger 
than at least one entry on the right of p { . Note that this way we could define f 
on the set of all zz-permutations starting with the entry 1, but in that case, f 
would not be a bijection. (For example, the images of 1342 and 1432 would be 
identical.) 

Example 4.37 

If/z=143265, then the labels of the nodes of f(p) are, from the leaf to the root, 0, 
1, 2, 0, 1, 1. See Figure 4.5. For easy reference, we wrote p, to the zth node of the 
path f(p). To avoid confusion, in this Figure, and for the rest of this subsection, 
the entries of p will be written in small, Roman letters, and the labels of the 
nodes will be written in large italic letters. Q 

It is easy to see that ^indeed maps into the set of /f(0, l)-trees: /(z+l)</(z)+l 
for all i because there can be at most one entry counted by /(z+1) and not 
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counted by l(i), namely the entry p i+l . All labels are certainly nonnegative and 

/( 1 )= 0 . 

To prove that / is a bijection, it suffices to show that it has an inverse, that is, 
for any y0(O, l)-tree T consisting of a single path, we can find the only permutation 
p so that f(p)=T. We claim that given % we can recover the entry n of the 
preimage p. First note that p is 1342-avoiding and starts by 1, so any entry on 
the left of n must be smaller than any entry on the right of n. In particular, the 
node preceding n must have label 0. Moreover, as n is larger than any entry 
following it in p, the entry n is the leftmost entry p, of p so that l(fj >0 for all j>i 
if there is such an entry at all, and n=p n if there is none. That is, n corresponds 
to the node which starts the uninterrupted sequence of stricdy positive labels 
that ends in the last node as long as there is such a sequence. Otherwise, n 
corresponds to the last node. 

Once we located where n is in p, we can simply delete the node corresponding 
to it from 7 and decrement all labels after it by 1. (If this means deleting the last 
node, we just change l(n- 1) so it is equal to l{n- 2) to satisfy the root-condition.) We 
can indeed do this because the node preceding n had label 0 and the node after 
n had a positive label (1 or 2), by our algorithm to locate n. Then we can proceed 
recursively, by finding the position of the entries n- 1, n- 2, 1 in p. This clearly 

defines the inverse of f so we have proved that /"is a bijection. 

As we promised, we continue by explaining which indecomposable 1342- 
avoiding permutations correspond to 0, l)-trees in which all labels are equal 
to zero. 


LEMMA 4.38 

There is a bijection gfrom the set of 132-avoiding n-permutations ending with n to the set of 
P(0, 1 )-trees on n vertices with all labels equal to zero. 

Note that we could describe the domain of g as the set of indecomposable 132- 
avoiding w-permutations, or as the set of (n-i )-permutations that avoid 132. 


PROOF In this proof, we can obviously think of our /f(0, l)-trees as unlabeled 
rooted plane trees. A branch of a rooted tree is a subtree whose root is one of the 
root’s children. Some rooted trees may have only one branch, which does not 
necessarily mean they consist of a single path. 

We will construct ^inductively. There is only one unlabeled /3(0, l)-tree no 2 
vertices and it is the image of the only 1-permutation; />= 1. Using induction, 
suppose we have already constructed g for all positive integers k<n. Let p be a 
132-avoiding permutation of length n. Let j> =pfy■ -p„s- Then there are two 
possibilities. 
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(a) The first case is when p is decomposable, that is, we can cut p into several 
(at least two) strings p < i>, p< 2 >, • ■ •, p<n> so that all entries of p<x> are larger 
than all entries of p<j> if z</. In this case, g(p) will have h branches, the 
branch b; satisfying g(p<T)=b,. We then obtain g(p) by connecting all branches 
bi to a common root. Given that we are in a /3(0, l)-tree, the label of the 
root is determined by the labels of its children. 

(b) The second case is when pi is indecomposable. As p avoids 132, this is 
equivalent to saying that p ends with its maximal entry n- 1. In this case, 
g(p) will have just one branch b h that is, the root of g(p) will have only one 
child. We define b x =g(p). 

Again, we prove that gis a bijection by showing that it has an inverse. Let The an 
unlabeled plane tree on n vertices with root q. Let q have ^children, and say that, 
going left-to-right, they are roots of the branches b u b 2 , b„ which have n h 
W 2 , ■■■,«, nodes. Then by induction, for each i, the branch b, corresponds to a 132- 

avoiding w-permutation ending with n,. Now add 'T,,. _ fij to all entries of the 

permutation p, associated with b t then concatenate all these strings and add n to 
the end to get the permutation p associated with T. 

It is straightforward to check that this procedure always returns the original 
permutation, proving our claim. 


Example 4.39 

The permutation 45631278 corresponds to the /3(0, l)-tree with all labels 
equal to 0 shown in Figure 4.6. For easy reference, we write p n to the root of 
g(p) and proceeded analogously for the other entries in the recursively defined 
subtrees. 

□ 

An easy way to read off the corresponding permutation once we have its entries 
written to the corresponding nodes is the well-known postorder reading: for every 
node, first write down the entries associated with its children from left to right, 
then the entry associated with the node itself, and do this recursively for all the 
children of the node. 

Our plan is to bring Lemmas 4.36 and 4.38 together to prove Theorem 4.34. 
This needs some preparation. Optimally, we would take a 1342-avoiding 
indecomposable «-pcrmutaiion/), associate its entries to the nodes of an unlabeled 
plane tree T, then define the labels of this tree so that it becomes a [:>((), l)-tree. 
The question is, however, how do we know what T we should use, and if T is 
given, in what order we should write the entries otp to the nodes of T. In what 
follows, we develop the notions to decide these questions. 
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FIGURE 4.6 

The p{ 0, l)-tree of>=45631278. 


DEFINITION 4.40 Two n-permutations x and y are said to be in the same weak 
class if the left-to-right minima of x are the same as those of y, and they are in the same 
positions. 

Example 4.41 

Permutations 456312 and 465312 are in the same weak class since their left-to- 
right minima are 4, 3 and 1, and they are located at the same positions. 
Permutations 31524 and 34152 are not in the same weak class. P 

PROPOSITION 4.42 

Each nonempty weak class C of n-permutations contains exactly one 132-avoiding 
permutation. 

PROOF Take all entries which are not left-to-right minima and fill all empty 
positions between the left-to-right minima with them as follows: in each step 
place the smallest element which has not been placed yet which is larger than the 
previous left-to-right minimum. (This is just what we did in the proof of the 
Simion-Schmidt bijection in Lemma 4.3.) 

On the other hand, the resulting permutation will be the only 132-avoiding 
permutation in this weak class because any time we deviate from this procedure, 
(that is, we place something else, not the smallest such entry) we create a 132- 
pattern. | 

DEFINITION 4.43 The normalization N(p) of an nfermutation p is the only 132- 
auoiding permutation in the weak class C containing p. 
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Example 4.44 

E^356214, then N(p)= 345216. □ 


DEFINITION 4.45 The normalization N(T) of a 0,1 )-tree T is the /3(0, 1 )-tree which 
is isomorphic to T as a plane tree, with all labels equal to zero. 

It turns out that normalization preserves the indecomposable property. 


PROPOSITION 4.46 

A permutation p is indecomposable f and only fN(p) is indecomposable. 


PROOF (The author is grateful to Aaron Robertson, who found a corrected a 
mistake in his original argument.) We will show that whether p is decomposable 
or not depends only on the set and position of its left-to-right minima, which is 
obviously equivalent to the claim to be proved. Let Cbe the weak class containing 
p, given by the set and position of its left-to-right minima. It is clear that ifp G C 
is decomposable, then the only way to cut it into two parts (so that everything 
before the cut is larger than everything after the cut) is to cut it immediately on 
the left of a left-to-right minimum a<n. Now if there is a left-to-right minimum in 
position n-a+2, then the entries 1, 2, •••, a-l must occupy positions n-a+2, n- 
a+ 1,—, n. Therefore, we can cut immediately on the left of position n-a+2, and p 
is decomposable. 

If there is no such a, then for all left-to-right minima m, all the entries 1,2, • • •, 
m- 1 must be to the right of m. However, in one of the positions n-m+2, n-m+1, ■■■, 
n, there exists an element y>m, implying that our permutation p is not 
decomposable. 


COROLLARY 4.47 

Ifp is an indecomposable n-permutation, then N(p) always ends in the entry n. 


PROOF Note that the only way for a 132-avoiding n-permutation to be 
indecomposable is for it to end with n. If p is a 132-avoiding n-permutation and n 
is not the last entry, then we may cut it just after the entry n. Then the statement 
follows from Proposition 4.46. 

Now we are in a position to prove Theorem 4.34. 
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FIGURE 4.7 

The unlabeled trees of jV(^ / )=4521367 and^=4621357. 


PROOF (of Theorem 4.34.) Let p be an indecomposable 1342-avoiding n- 
permutadon. Take N(p)=r. By Corollary 4.47 its last element is n. Define F(r) to 
be the unlabeled plane tree S associated with r by the bijection g of Lemma 4.38. 
So g is just the restriction of F to the set of indecomposable 132-avoiding 
permutations. 

This unlabeled tree S is the tree we are going to work with. First, we will write 
the entries of p to the nodes of S. (The reader should recall that we did this in the 
proof of Lemma 4.36, and that the entries of p written to the nodes of S are not to 
be confused with the labels of the nodes.) We will do this in the order specified by 
p and N(p). That is, N(p) is a 132-avoiding permutation, so its entries are in 
natural bijection with the nodes of S as we saw in Lemma 4.38. We then let the 
permutation p(N(p)) A act on the entries of N(p) (written to the nodes of S) to get 
the order in which we write the entries of p to S. Note in particular that the left- 
to-right minima are kept fixed. 

Example 4.48 

Let p=462 1357. Then M(p )=4521367, and the unlabeled plane tree S associated 
with these permutations is shown in Figure 4.7, together with the order in which 
the entries otp are written to the nodes. Note thatj^ and N(p) only differ in the 
transposition (56). This is why it is these two entries whose positions have been 
swapped. 

Now we are going to define the label l(v) of each node v for the new y3(0, 
l)tree T=F(p) that we are constructing from S. As an unlabeled tree, T 
will be isomorphic to S, but its labels will be different. Let i be the zth 
node of T in the postorder reading, the node to which we wrote pi. We 
say that p, beats pj if there is an element p h so that p h , p b pj are written in 
this order (so A<z</) and they form a 132-pattern. Moreover, we say 


2004 by Chapman & Hall/CRC 




In Any Way But This. Pattern Avoidance. The basics. 153 



FIGURE 4.8 

The image F(p) of ^'=4621357. 

that pi reaches p k if there is a subsequence PisPi+au '' ' Pi+a t , pK of entries 
so that i<i + ai<i + a%<---<i + at<k and that any entry in this subse¬ 
quence beats the next one. In particular, if x beats y, then x also reaches y. 


Example 4.49 

Let y^=3716254. Then 7 beats 6, 6 beats 2, therefore 7 reaches 2. Q 
Finally, we set 

l(i)=\{jis a descendant of i (inclusive) so that there is at least one />zfor which pj 
reaches p t } |, 

and let F(p) be the /3(0, l)-tree defined by these labels. A descendant of i is an 
element of the tree whose top element is i. Note that this rule is an extension of 
the labeling rule we have in Lemma 4.36. 

First, it is easy to see that /' indeed maps into the set Indeed, let v be 

an internal node and let v h v 2 , •••, v k be its children. Then l(v)<l + l{vk) 

because there can be at most one entry counted by l(v) and not counted by any 
of its children’s labels, namely v itself. Second, all labels are certainly nonnegative 
and all leaves, that is, the left-to-right minima, have label 0. 


Example 4.50 

In Example 4.48 we have created the unlabeled tree S for /z=4621357. 
Application of the above rule shows that F(p) is the /3(0, l)-tree shown in Figure 
4.8. Indeed, the only 132-pattern of p is 465, and that is counted only once, at 
the entry 6. D 


2004 by Chapman & Hall/CRC 



154 


Combinatorics of Permutations 




FIGURE 4.9 

The tree S and F(p) for ^=58371624. 


Example 4.51 

Let/>=58371624, then we have Jf(p)= 56341278, giving rise to the unlabeled 
tree shown on the left of Figure 4.9. We then compute the labels of F(p) to obtain 
the tree shown on the right of Figure 4.9. Q 

To prove that F is a bijection, it suffices to show that it has an inverse. That is, it 
suffices to show that for any /f(0, l)-tree T £ we can find a unique 

permutation p so that F(p)=F. 

We again claim that given 7, we can recover the node j that has the entry n of 
the preimage p associated with it, and so we can recover the position of n in the 
preimage. 


PROPOSITION 4.52 

Suppose p n lm, that is, n is not associated with tlw root vertex. Then each ancestor of n, 
including n itself, has a positive label. Ifpj=n, then l(n)=Q and thus there is no vertex with the 
above properly. 


PROOF If p,=n, then there is nothing on the right of n to reach, thus l(v) 
enumerates an empty set, yielding p = 0. Suppose now that p n is not the root 
vertex. 

To prove our claim it is enough to show that for any node i that is an ancestor 
of pj=n, there is an entry p k so that !i>i, and n=pj reaches k. Indeed, this would 
imply that the entry pf=n is counted by the label l(i) of i, forcing l(i)> 0. Now let 
a m =pd>af>-- ->ai=l be the left-to-right minima of p so that n is located between 
a r and a r+1 . (If n is located to the right of «i=l, then a-^nx is obviously a 132- 
pattern for any x located to the right of n.) Then n certainly beats all elements 
located between a T and «, +1 as a, can play the role of 1 in the 132-pattern. 
Clearly, n must beat at least one entry y 1 on the right of a T+ 1 as well, otherwise 
p would be decomposable by cutting it right before a r+1 . Uy t is on the right of 
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i, then we are done as y\ can be chosen for p k . If not, then j 1 , must beat at least 
one entry y 2 which is on the other side of a r i+i where y is located between a rl 
and a r i+i for the same reason, and so on. This way we get a subsequence y ), 
y 2 , • • ■ so that n reaches each of the y t , and this subsequence eventually gets to 
the right of i, since in each step we bypass at least one left-to-right minimum. 
Thus the proposition is proved. 

The only problem is that there could be many vertices with the property that all 
their ancestors have a positive label. If that happens, we resort to the following 
Proposition to locate the vertex associated with n. 


PROPOSITION 4.53 

Suppose p^n. Then n is the leftmost entry of p which has the properly that each of its 
ancestors has a positive label. 


PROOF Suppose/^ and wboth have this property and that p l: is on the left of n. 
(If there are several candidates for the role of fi h choose the rightmost one). If p k 
beats an element y on the right of n by participating in the 132-pattern xp k y, y, 
then xp h ny is a 1342-pattern, which is a contradiction. So p k does not beat such an 
element y. In other words, all elements after n are smaller than all elements before 
pk- Still, pi, must reach elements on the right of n, thus it beats some element v 
between p k and n. This element v in turn beats some element w on the right of n 
by participating in some 132-pattern tvw. However, this would imply that tvnw is 
a 1342-pattern, a contradiction, which proves our claim. | 


Therefore, we can recover the entry n of p from T. Then we can proceed as in the 
proof of Lemma 4.36, that is, just delete n, subtract 1 from the labels of its 
ancestors and iterate this procedure to get p. If at any time during this procedure 
we find that the current root is associated with the maximal entry that has not 
been associated with other vertices yet, then there are two possibilities. 


(a) If the tree has only one branch at this moment, then simply remove its root 
(and the maximal entry with it), and adjust the label of the new root so 
that it is the sum of the labels of its children. 

(b) If the tree has more than one branch at this moment, then deleting the root 
vertex will split the tree into smaller trees. Then we continue the same 
procedure on each of them. The set of the entries associated to each of 
these smaller trees is uniquely determined. Indeed, the fact that our current 
tree T' has more than one branch is equivalent to the fact that the current 
partial permutation p becomes decomposable when the maximal element 
(the one associated to the root of T) is removed. We have seen this for 
unlabeled trees in the proof of Lemma 4.38, and we know from Proposition 
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4.46 that p is indecomposable if and only if N(p) is indecomposable. So the 
entries are assigned to the subtrees so that each subtree consists of larger 
entries than the subtrees on its right. 

Therefore, we can always recover p in this way from T. This proves that Pis a 
bijection, completing the proof of Theorem 4.34. 

COROLLARY 4.54 

The number of indecomposable 1342-avoiding n-permutations is 


\D^°T\= tn = 3-2 


(2n — 2)! 

(n + l)!(n — 1)! * 


(4.7) 


PROOF Follows from (4.6) and Theorem 4.34. | 

Computing the numbers 5’„(1342) is now a breeze, (well, if you like generating 
functions). 


LEMMA 4.55 

Let s=S n (134:2) and let H(x) = Yff_ Q s n x n . Furthermore, let F(x) = '22’^= j tnx” 

that is, let F(x) be the generating function of the numbers ofindecomposable 1342-avoiding 
permutations. Then 

^ ~ 1 “ F ( x ) ~~ —8x 2 + 20a; + 1 — (1 — 8x) 3 / 2 ’ ( 4 - 8 ) 


PROOF Tutte [191] has computed the ordinary generating function of the 
numbers t„ and obtained 


F(x) - ^2 t n x n 

n=l 


Y, 3 • 2 n ~ 1 


n =1 


(2n - 2)! 

(n + l)!(n — 1)! 


(4.9) 


_ 8x 2 + 12x-1 + (1-8x) 3 / 2 ,, im 

32x ’ ^ ' 

The coefficients of F(x) are the numbers of indecomposable 1342-avoiding n- 
permutations. Any 1342-avoiding permutation has a unique decomposition into 
indecomposable permutations. This can consist of 1, 2,..., n blocks, implying 
that s n = Y22t= l ti s n-i■ Therefore, F4(x)=ll(l-F(x)) as claimed. 

■ 

It is time that we mentioned the other kind of objects that are in bijection with 
these permutations. These are rooted bicubic maps, that is, planar maps in which 
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each vertex has degree three, there is a distinguished half-edge (the root), and the 
underlying graph is bipartite. Tutte was enumerating these maps (according to 
the number 2(w+l) of vertices) when he obtained formula (4.9), and Cori,Jacquard, 
and Schaeffer then used the y0(O, l)-trees to find a more combinatorial proof of 
Tutte’s result. 

Now that we have the generating function of the numbers 5„(1342), we are in 
a position to obtain an explicit formula for their number. That formula will 
prove Theorem 4.31. 

PROOF (of Theorem 4.31). Multiply both the numerator and the denominator 
of H(x) by (-8x 2 +20x+l)+(l-8x) 3/2 . After simplifying we get 


H(x) = 


(1 - 8x) 3 / 2 - 8x 2 + 20x + l 
2(x + l) 3 


(4.11) 


As (1 — 8x) 3 / 2 = 1 — 12x + Y ^ n >2 3 • 2 n+2 x ra jffip formula (4.11) implies 
our claim. 


So the first few values of 5„(1342) are 1, 2, 6, 23, 103, 512, 2740,15485, 91245, 
555662. 

In particular, one sees easily that the formula for >S’„(1342) given by Theorem 
(4.31) is dominated by the last summand; in fact, the alternation in sign assures 
that this last summand is larger than the whole ritrht hand side if n>8. As 

x . \ | ^_2 o O 

n!(”~ 2 )! < ypr by Stirling’s formula, we have proved the following exponential 
upper bound for 5„(1342). 

COROLLARY 4.56 

For all n, we have 5„(1342)<8’ 1 . 


On the other hand, it is routine to verify that the numbers t n satisfy the recurrence 
4=(8ra-12)4i/(n+l). As we explained immediately after Conjecture 4.9, the fact 
that S„(1342)<8” implies that the limit y/S n ( 1342) exists. Therefore, by the 
Squeeze Principle, we obtain the following Corollary. 

COROLLARY 4.57 

We have 


lim l/S n {lM2) = 8. 

n—SQo 

This result is again striking for two different reasons. On one hand, this is the 
third time that we can compute lim ra _ >00 \/S n (q) for some pattern q. In fact, we 
have seen that 
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4 if q is of length 3, 


lim \/S n (q) = (k — l) 2 if q = 123 ■ ■ ■ k, 

n—yoo 

8 if q = 1342, 


In other words, in every case when we saw an exact answer, the exact answer was 
an integer. In general, however, that does not hold. Present author [41] has 
recendy proved that \/S n ( 1 2453) = 9 + 4 y'2- So these limits are not 

even always rational. 

The other surprise provided by Corollary 4.57 is that 

lim s/S n {lM2) = 8f lim ^/S~( 1234) = 9. 

n —boo n —boo 

That is, even in the logarithmic sense, the sequences S, ,(1342) and 5„(1234) are 
different. This phenomenon is not well understood. If we could understand why 
the pattern 1342 is really so easy to avoid, then maybe we could use that 
information to find other, longer patterns that are easy to avoid. 


4.4.3 The Pattern 1234 


The pattern 1234 is a monotone pattern, therefore Theorem 4.11, that provides 
an asymptotic formula and a very good upper bound for the numbers >S’„(123 ■ • • A), 
applies to it. We would like to point out, however, that using certain techniques 
beyond the scope of this book, Ira Gessel [105] proved the following exact formula 
for these numbers 


S„(1234) = 2 • Y, 


k =0 


2k' 


2 3 k 2 + 2k + 1 — n — 2 nk 
(k + 1 f(k + 2)(n - k + 1 ) ’ 


(4.12) 


The alert reader has probably noticed that the summands on the right-hand side 
are not always non-negative, which decreases the hopes for a combinatorial proof. 
However, a few years later Gessel found the following alternative form for his 
formula [104] 


S„(1234)= 


1 

(n + l) 2 (n + 2) 



(4.13) 


In this new form, all terms are nonnegative, but there is still a division, suggesting 
that a direct combinatorial proof is probably difficult to find. 

We will return to the surprising complexity of Gessel’s formulae in the next 
chapter. 
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4.5 The Proof of The Stanley-Wilf Conjecture 

The goal of this section is to present a proof of the Stanley-Wilf conjecture. In 
order to achieve this goal, we will discuss one more conjecture involving 0-1 
matrices, show why it implies the Stanley-Wilf conjecture, and then prove the 
conjecture on the 0-1 matrices. 


4.5.1 The Fiiredi-Hajnal conjecture 

Let us extend the notion of pattern avoidance to 0-1 matrices as follows. 


DEFINITION4.58 Let A and P be matrices whose entries are either equal to 0 or to 1, and 
let P be of size kxl. We say that A containsPfA has a kxl submatrix Qso thatfP,j=l , then 
Qij=\,for all i andj. If A does not contain P, then we say that A avoids P. 

In other words, A contains Pif we can delete some rows and some columns from 
A and obtain a matrix (What has the same shape as P, and has a 1 in each 
position Pdoes. Note that (lean have more 1 entries than P, but not less. Note 
that if all entries of A are equal to 1, then A contains all matrices Pthat have 
shorter side lengths than A. 


Example 4.59 

(l 00 1 \ 

0 110 


Let A = 


0 0 11 

\ 1000 / 


, and let P = 


00 
1 1 


!. Then A contains P as can be seen 


by deleting everything from A except the intersection of the first and third rows 

_ f ll\ 

with the third and fourth columns. If Q — I ^ ^ 1 then A avoids (f Q 


All matrices in this section will be 0-1 matrices, so we will not repeat that 
condition any more. The famous Fiiredi-Hajnal conjecture, which was originally 
stated as a question, sounds similar to the Stanley-Wilf conjecture. 


CONJECTURE 4.60 [101] [Fiiredi-Hajnal conjecture] Let P be any permutation matrix, 
and letf(n, P) be the maximum number of1 entries that a P-avoidingny.n matrix A can have. 
Then there exists a constant c P so that 
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f(n, P)—c/i- 


4.5.2 Avoiding Matrices vs. Avoiding Permutations 

In this Section, we present Martin Klazar’s argument proving that the Fiiredi- 
Hajnal conjecture implies the Stanley-Wilf conjecture. Interestingly, in order to 
prove this connection between permutations and matrices, Klazar introduces yet 
another notion of avoidance and containment, that of bipartite graphs. 


DEFINITION 4.61 Let G([a], [a]) and H([b], [b’J) be simple bipartite graphs. We say 
that G contains H as an ordered subgraph if there exist order preserving injections f: 
[b]—>[a] andf: [b’]—>[a’] so that if vd is an edge of H, thenf(v)f’ (vj is an edge of G. 

Each w-permutation p=pfr"pn defines a bipartite graph Gp on ([n], fnj) in the 
natural way. That is, Gp has n edges, given by (p b i), for i £ [n]. The following is 
then immediate from the definitions. 


PROPOSITION 4.62 

If the permutation p contains the permutation q as a pattern, tlwn G p contains G ? as an ordered 
subgraph. 

Therefore, if Gp(n) is the number of simple bipartite graphs on ([n], fnj) avoiding 
the graph G p , then we have S„(q)^G p (n). 

Now note that if a bipartite graph G([n], fnj) avoids Gp, then the adjacency 
matrix A ( G) of G must avoid the adjacency matrix of Gp. Assume for the rest of 
this subsection that Conjecture 4.60 is true. Then A(G) can have at most Cjn 
entries equal to 1, that is, G can have at most c P n edges. We are going to show 
that this leaves at most an exponential number of possibilities for G. 

Let us contract G to the smaller bipartite graph Gi that has vertex set 
([fn/2]], [|"n/2]]') as follows. If i and j' are two vertices of Gi from different 

color classes, then let if be an edge of Gj if there is at least one edge between the 
sets of vertices (2z : l, 2z] and {(2/1)', (2/'} in G. 

It is then clear that Gi inherits the G^-avoiding property of G. On the other 
hand, there are at most 15 £t ” /2] different graphs G that can lead to the same graph 
G x as there are 15 possible nonempty subsets of edges between (2z : l, 2z] and {(2/ 
1)', (2/)'} in G, and, by Conjecture 4.60, Gj has at most <[n/2\ edges. That is, we 
have 


G p (n) < 15 c !"/ 2 1 G p ([~n/2~|). 
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Iterating this argument until on the right hand side we have G p { 1)=2, we get 


Gfn)<15 2m . 

p\ s 

So, by Proposition 4.62, we see that 

S.feX 15 2 “- (4.14) 


Therefore, we have proved the following. 

PROPOSITION 4.63 

[133] If the Furedi-Hajnal conjecture is true, then the Stanley-Wifconjecture is also true. 


4.5.3 The Proof of the Furedi-Hajnal conjecture 

We close this Chapter by presenting the recent spectacular proof of Conjecture 
4.60, given by Marcus and Tardos. 

Let P be a kxh permutation matrix, and let A be an nXn matrix that avoids P 
and contains exacdy /yi, P) entries 1 (as defined in Conjecture 4.60). Assume for 
simplicity that n is divisible by Id. The crucial idea of the proof is a decomposition 
of A into blocks. While the simple idea of decomposing a matrix into smaller 
matrices is not new, the novelty of the Marcus-Tardos method is that it decomposes 
A into (p ' p) blocks, which are each of size k 2 xk 2 . 

For (i,j) € [p] x [p-j, let 5^ denote the submatrix (block) of A that consists 
of the intersection of rows (zT)P+l, (z : l)£ 2 +2, ik 2 and columns (j-VjP+l, 
(/T) k 2 +2, ■ ■■fid. We will now contract A into a much smaller matrix B as follows. 
Each entry of B will contain some information about a block of A. The matrix 
B=(b,) is of size p x and 

{ 0 if all entries of S ,,are zero, 

1 if not all entries of Sij are zero. 

PROPOSITION 4.64 

The matrix B avoids P. 


PROOF Assume not, and take a copy P c of P in B. Then considering A, and 
using the fact that Pis a permutation matrix, we can take a 1 from each block of 
A that defined an entry of P,, and get a copy of P in A. | 

The next crucial step is the following definition. 


2004 by Chapman & Hall/CRC 


162 


Combinatorics of Permutations 


DEFINITION 4.65 A block Sij of A is called Wide fit contains a 1 in at least k different 
columns. Similarly, a block is called tall if it contains a 1 in at least k rows. 

Note that a block has Id columns, but it is called wide if at least k of these columns 
contain a 1. 

LEMMA 4.66 

For any fixed j, the set ofblocks Cj = {Si,j |1 < i < \ ofthe matrix A contains less 

than (f — 1) ^ ) + 1 w ^ e blocks. 

PROOF We show that if the statement of the lemma were false, then A would 
contain a copy of P. Indeed, assuming that the number of wide blocks in Cj is at 
least {ff — 1) ) -pi by the pigeon-hole principle there would be a A-tuple of 

integers 1 <c 1 < Cc 2 <-■ ■<c 4 £ Id so that there are k blocks S (ll y, S a . 2 y, ■ ■ •, S ak ,j that 
all contain a 1 entry in column for 1 <z<£. In that case, it is easy to find a copy 
of Pin A, which is a contradiction. Indeed, if the single 1 in column i of Pis in 
row p(i), then choose a 1 from column c, of S ap(i) ,j. As the blocks 
S ait j, Sa, 2 ,ji' " i $a k ,j are positioned in a column, the n entries 1 chosen this 
way will prove that A contains P, which is a contradiction. | 

It goes without saying that the same argument can be made for the array of 
blocks Ri = {Aijll <j < p-}, thereby giving the following lemma. 


LEMMA 4.67 

For any fixed i, the set <f blocks Ri = |1 < j < } °fthe matrix A contains less 

than iff — 1) ^ j -)- 1 tall blocks. 

We have seen that A cannot have too many wide or tall blocks, and Proposition 
4.64 seems to suggest that A cannot have too many nonzero blocks either. Putting 
together these observations, we get the following recursive estimate. 

LEMMA 4.68 

For any kxk permutation matrix P, and any positive multiples n of Id, we have 



PROOF By Proposition 4.64, the number of nonzero blocks is at most 
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/ (p, P)- By Lemmas 4.66 and 4.67, there are at most A T (k — 1)(*~) wide 

blocks, and at most jf(k — 1)(^ ) tall blocks. 

Let us count how many 1 entries the various blocks of A can contribute to f(n, 
P). A wide (or tall) block can contribute at most P entries 1, so the total 
contribution of these blocks is at most 




• k 4 < 2 k 3 


n. 


If a block is neither wide nor tall, then by the pigeon-hole principle, it can contain 
at most (/i-l) 2 entries 1. Multiplying this bound by the number of nonzero blocks 
yields that the contribution of all blocks that are not tall or wide is at most 


Adding all the contributions of all these blocks, the lemma is proved. 


We now have all necessary tools to prove the Fiiredi-Hajnal conjecture. 

THEOREM 4.69 

For all permutation matrices P ofsize hxh, we have 

f(n,P)< 2fc 4 (*Qn. 

PROOF We prove the statement by induction on n, the initial case of n= 1 being 
obvious. (In fact, the statement is obvious when n<B, because then A has at 
most IP entries.) 

Now assume the statement is true for all positive integers less than n, and 
prove it for n. Let n! =[n/k 2 ]Id. Then by Lemma 4.68, we have 


f(n,P) < f(n',P) + 2k 2 n 

<(*-i >V (p>-r 


2k J 


k 2 


ri + 2 k 2 n 


as in the worst case, we fill the part of A that cannot be partitioned into /dx/d 
blocks by entries 1. Applying the induction hypothesis to / (p,^). we get 


f{n,P)<(k- l) 2 


2k«l k V' 


k ) k 2 


< 2 k 2 ((k - l) 2 + k + 1) 

< 2fc 4 ( * "W 


+ 2 k 3 


k 2 


n' + 2 k 2 n 
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since t¥>(k-l) * 1 2 3 4 5 6 7 8 9 +k+\ whenever £>2. | 


The proof of the Stanley-Wilf conjecture is now obvious. We include it because 
we want to show the numerical result obtained. 


COROLLARY 4.70 

For any permutation pattern q of length k, we have 

S n (q) < 


where Cq — 15 2fc4 (\ ). 

PROOF Immediate from (4.14) and Theorem 4.69. | 

The value c q = 15 2 * ( k ) is certainly very high. For instance, if k= 3, then we get 
Cq = 15 27216 , while we have seen that c.=4 is the best possible result. Therefore, 
there is a lot of room for further research here. 


Exercises 

1. Prove that S„(123--k)=S„(123--k(k-l)). Here the pattern on the right is the 
monotone pattern of length k, with its last two entries reversed. Do not use 
Theorem 4.12. 

2. (a) Find a formula for S 4+1 (q) where q is any pattern of length k. 

(b) Find a formula for S t+2 (q) where q is any pattern of length k. 

3. Find a formula for 6’„(132, 312). 

4. Find a formula for S n (p, q), where p and q are patterns of length three. Go 
through all possible choices of p and q. How many different sequences are 
there? 

5. Find a formula for S„(132, 1234). 

6. Prove that/’(w)=5'„(132, 3421)=l+(w-l)2”' 2 . 

7. Prove that g(n)=S„( 132, 4231)=l+(w-l)2”' 2 . 

8. Find a formula for S„(132, 4321). 

9. (a) Find the ordinary generating function X^n>o l - c ’n(3142,2413)x”. 
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(b) Find other pairs of patterns (p, q) so that S„(p, q)=S n ( 3142, 2413). 
(Do not look for the easy way out. We are interested in pairs that 
cannot be obtained from (3142, 2413) by iterating the trivial 
equivalences.) 

10. The result of Exercise 6 shows that S„(132, 3421) is always odd if m> 3. 
Prove this fact by finding an involution on the set enumerated by .S„(132, 
3421) that has exacdy one fixed point. 

11. (a) Let A(w)=.S'„(1324, 2413). Compute the ordinary generating function 

H(x) = E“=o h (n)x n . 

(b) Find other pairs of patterns (p, q) so that S n (p, ^)=5'„(1324, 2413). 
Again, we are looking for nontrivial examples. 

12. In Exercises 6 and 7, we have seen examples for pairs of patterns p and q so 
that S n (p, q) is odd for sufficiently large n. Are there pairs of patterns p and 
q so that Sjp, q) is even for sufficiendy large n ? 

13. Let I„(q) be the number of involutions of length n that avoid the pattern q. 
Find the ordinary generating function for the numbers 7/2143). Note that 
involutions enumerated by /„(2143) are called vexillary involutions. 

14. Are two patterns less restrictive than three patterns? More precisely, let a, 
b, c, d, and e be five patterns of the same length. Is it then true that 

S/a, b)>S/c, d, e) 

as long as (a, b)*( 123—A, A(AT)--T)? 

15. Find an explicit formula for the number of 132-avoiding w-permutations 
that are decomposable into three blocks, each block consisting of an 
indecomposable permutation. 

16. Find a formula for the number of indecomposable 123-avoiding 
permutations. 

17. Find a Injection/"from the set of 231-avoiding n-permutations to the set of 
northeastern lattice paths from (0, 0) to (n, n) that do not go above the 
main diagonal. What parameter of these paths will correspond to the 
number of ascents of the corresponding permutations? 

18. Give an example of two patterns p and q so that S n (p, q) is a polynomial. 

19. Are there any patterns p and q so that S n (p, q)= 0 if n is sufficiendy large? 

20. Let E n (q) (resp. O/q)) denote the number of even (resp. odd) w-permutations 
that avoid a given pattern q. Prove that 

E„(132)= AiAiziMt. 
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FIGURE 4.10 

Two distinct binary plane trees. 


21. Prove that 

£„(231)-0„(231)=£„(312)-0„(312)=(-1)^ C W)/2 . 

22. Prove that 

£„(213) - 0„ (213)=C ( „.] )/2 . 

23. Give an example of three patterns m, q and r so that 

lim s/Sn{m,q,r) 
n —too 

is not an integer. 

24. Assume we know the numbers gC*) (g) of permutation of length n having 
exacdy k fixed points that avoid a certain pattern q. For what other patterns 
q can we obtain the numbers gW directly from these data? 

25. Prove that if £>3, then there exists a pattern of length k so that for n 
sufficiendy large, we have S n {123---li)<.S n (q). 

26. Find a formula for 7„(132). 

27. (a) A binary plane tree is a rooted plane tree with unlabeled vertices in 

which each vertex that is not a leaf has one or two children, and each 
child is either a left child or a right child of its parent. So the two 
binary plane trees shown In Figure 4.10 are different. Prove (preferably 
by a bijection from the set of 132-avoiding w-permutations to the set 
of binary plane trees on n vertices) that the number of binary plane 
trees on n vertices is C„. 
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(b) What parameter of these plane trees will correspond to the number 
of descents of the corresponding 132-avoiding permutation? 

(c) How can we decide from a binary plane tree whether the 
corresponding 132-avoiding permutation has a descent in z? 

28. Let Djk) denote the number of 321-avoiding ?z-permutations that start in 
an ascending run consisting of exactly k elements. Prove that 



29. Prove that 6’,,(1234) and S„(1324) are not asymptotically equal. 

30. (a) Let q be a pattern that starts with the entry 1, and assume that S n (q)<R 

for some positive constant c. Let (j be the pattern that is obtained from 
q by inserting the entry 1 to the front of q and increasing all other 
entries of q by 1. Prove that S n (tf)<(4:c) n , for all positive integers n. 

(b) Keeping the notations of part [(a)], prove that S n (q) < (1 + s/c) 2n - 

31. Let q be a pattern that has an entry x that is a left-to-right maximum and a 
right-to-left minimum at the same time, and assume that S„(q)K d for all n. In 
other words, we have q=LxR for some strings L and R. Let us replace x by a 
pattern p so that the obtained pattern / has form LpR, where all entries of L 
are still smaller than all entries of p, and all entries of p are still smaller than 
all entries of R. State certain conditions for p, then prove that if those conditions 
hold, then there exists a constant if so that S n (q')<TC" for all n. 

32. Let q k = 1 k-\ k-2—2k. Use the methods of Section 4.4 to find a relatively 
small (that is, smaller than the general constant found in Section 4.6) positive 
constant c k so that S n {qu ) < dt. for all n. Does your proof generalize to 
other patterns? 

33. Let q 6 Sk. Prove that S n (q)^c(k- 1)”, for some positive constant c. 

34. For what positive integers n will .S’„(123) be an odd integer? 

35. The bijection F that we constructed in the proof of Theorem 4.34 maps 
the set of indecomposable 1342-avoiding zz-permutations to the set 

of >0(0, l)-trees on n vertices. Let A be the set of 123-avoiding zz-permutations 
that are in the domain of F. Describe F(A). 

36. Find a formula for /„(231). 

37. Let/he the bijection defined In Exercise 17. Let B m be the set of permutations 
in the domain of f that have exactly m inversions. Give a simple description 


of/(SJ. 


38. Find three patterns p, q, and r so that for all positive integers n, we have 
S n (p, q, r)=n. 
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39. Are there infinitely many nontrivial pairs of patterns p and q so that 
Sjp,q)<h (p4 (n) for all n, where h ^ (n) is a polynomial function of n ? By 
nontrivial pairs we mean pairs in which at least one of the two patterns has 
at least two alternating runs. 

40. Find a formula for the number of 132-avoiding w-permutations whose 
longest decreasing subsequence is of length exacdy k+\. 

41. Find a formula for the number Bfa h aaf of all permutations of the 
multiset {l al 2 a2 3“ 3 }that avoid both 123 and 132. 

42. Let us extend the notion of permutation pattern avoidance for words over a 
finite alphabet {1, 2, ■■■, rri\ as follows. We say that a word w contains the 
permutation pattern q=q\ q?y-qf& we can find ddistinct entries in w, denoted 

( 2 a, a d from left to right so that if and only if d<idj. 

We say that a word t on the alphabet M={ 1, 2, •••, m) is k-regular if the 
distance between two identical letters in t is at least k. Let l h (q, m) be the 
maximum length that a A-regular word over M can have if it avoids the 
permutation pattern q. Assume that we know that there exists a constant 
c=Ck,q,m so that 

l k (q, m)<cm. (4.15) 

Prove the Stanley-Wilf conjecture from this assumption. 


Problems Plus 

1. Prove that the number of w-permutations that avoid 132 and have k+\ left- 
to-right minima is £ (”) ( fc ^). 

2. Prove a formula for the number d,(132) of derangements of length n that 
avoid 132. 

3. (a) Let p be a permutation that is the union of an increasing and a 

decreasing sequence. Such permutations are called skew-merged. Can 
we characterize p in terms of pattern avoidance? 

(b) Find a formula for the number of skew-merged w-permutations. 

4. (a) Prove that 


S n (1234,3214) 


4" _1 + 2 


3 
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(b) Find other pairs of patterns p and q so that the equality Sjp, q)= 
>S’„(1234,3214) holds for all n, besides those pairs obtained from (1234, 
3214) by trivial symmetries. 

5. Prove that 

S„(123,132, (k - 1 )(k -2) ■■■!&) = S n ( 123,132,123 • • ■ k) = F^, 

where F^ = 0 is the Mi generalized Fibonacci number defined F, = 0 by if 
n < 0, F[ k) = 1, and Ei k) = £*=1 F i% for n > 2 - for n ~ 2 ’ 

6. Prove that for all positive integers n, we have 5w(1243, 2143, 321)= 

n + 2© 

7. Prove that for all positive integers n, we have S, ,(1243, 2143, 231)= 
(w+2)2”' 3 . 

8. Let us generalize the notion of pattern avoidance as follows. A generalized 
pattern is a pattern in which certain consecutive elements may be required 
to be consecutive entries of a permutation. For instance, a generalized 31- 
2 pattern is a 312-pattem in which the entries playing the role of 3 and 1 
must be consecutive entries of the permutation (this is why there is no 
dash between 3 and 1), but the entries playing the role of 1 and 2 do not 
have to be consecutive (this is why there is a dash between 1 and 2). 

For example, a 3-1-2 generalized pattern is just a traditional 312-pattern, a 
2-1 generalized pattern is an inversion, while a 21 generalized pattern occurs 
when the permutation has a descent. 

Prove that S„(1-23)=I>,„ where B„ is the nth Bell number, denoting the number 
of all partitions of the set [n]. 

9. Is generalized pattern avoidance always stricter than traditional pattern 
avoidance? That is, let ^be a traditional pattern, and let <j be a generalized 
pattern so that <j becomes q if all the consecutiveness restrictions are released, 
but there are some consecutiveness restrictions. Is it then true that S n (q)l>S n (cf) 
if n is large enough? 

10. For a permutation/;, and a generalized pattern q, let q(p) denote the number 
of occurrences of q 'mp. So the examples of Problem Plus 8 can be written 
as 

d(p)=2\(p), and i(p)={ 2-1) (p). 

Find a similar expression for maj(p). 

11. In Exercise 42, we proved the Stanley-Wilf conjecture from formula (4.15), 
which we assumed without proof. Prove formula (4.15). 
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12. Let q be any pattern of length k-3. Prove that for all n, we have S n ((k-V){k- 
2)kq)=S n ({k-2)k{k-l)q). 

13. Prove that for any permutation patterns q of length k, we have 

L(q) - lim s/Sfq) > 

n—xx> C° 

14. Let q be a decomposable pattern, let L(q) be defined as in the previous 
Problem Plus, and let c( be the pattern obtained by prepending q by the 
entry 1 (and increasing all other entries by 1). 

(a) Prove that L(q') > 1 + L(q) + 2fL(q)- 

(b) Prove that in the special case when q starts in the entry 1, we have 
L(q r ) = 1 + L(q) + 2 fL(q)- 


Solutions to Problems Plus 

1. First, note that in a 132-avoiding permutation, the entry p l is a left-to-right 
minimum if and only if either t=l, or z'-l is descent. Therefore, we are 
looking for the number of 132-avoiding w-permutations with k descents, 
or, by taking the reverse, 231-avoiding w-permutations with k ascents. We 
know from Exercise 17 that the number of such w-permutations is equal to 
the number of northeastern lattice paths from (0, 0) to (n, n) that have k 
north-to-east turns and that never go above the main diagonal. A 
comprehensive survey of lattice paths of this and more general kinds can 
be found in [137]. 

In order to count these paths, note that the north-to-east turns of such a 
path completely determine the path. It therefore suffices to count the possible 
positions for the /i-clcmcnt set of north-to-east turns of such a path. Let (a,, 
bj be the coordinates of the ith north-to-east turn of a northeastern lattice 
path r; then the vector 

«i, «2,-, a h b h bn,-, b k (4.16) 

completely determines r. Disregard for the time being the requirement that 
r do not go above the main diagonal. Then we have 


0' a 1 <a 2 <- • w-1 


and 


1 < bfb^f. ,.<Cb t <n. 
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Clearly, all vectors (4.16) satisfying these conditions define a northeastern 
lattice path with k turns, proving that the number of these lattice Paths is 

©©■ 

Now we have to count the vectors (4.16) defining a lattice path that goes 
above a main diagonal, that is, we have to determine for how many vectors 
(4.16) there exists an index i so that a<db,. Let s be such a vector, and let z be 
the largest index for which a,<b r Define 

f(s)=(b v b 2 , b. p a M , a k , a v a f b f -, b). 

Then f(s) satisfies the chains of inequalities 

1 ■■•<«. <•••<« <re-l 

12 z-1 2+1 k 

and 

,<•••<£< re-1. 

1 Z 2 2+1 « 

Therefore, we have (£+ 1 ) (&_i) possibilities for/"(is). As yds a bijection (this 
has to be shown), this means that this is also the number of possibilities for 
s. So the number of northeastern lattice paths with k north-to-east turns 
that do not go above the main diagonal is 

( n \f n \ / « + 1\ ( n - 1, _ 1 f n\ / n \ 

\A/ \k) \k + lj \k — l) n \fc,/ \k + l) 

These numbers are often called the Narayana numbers, and are denoted 
A(n, k). 

2. It is proved in [165] that 

1 n ~ 2 /_i \ * 

d„(132) = - ■ ^ 

i=0 ^ ' 

The numbers d n are called the Fine numbers. An alternative formula for 
these numbers is 

2n - 2k - 1\ _ /2 n - 2k - 1 
ft — 1 ) \ n 

3 (a) It is proved in [174] that the skew-merged permutations are precisely 
the permutations that avoid both 2143 and 3412. 

(b) M.Atkinson [9] proved that the number of these permutations is 
5 „( 2143 , 3412 )=( 2 ;)-£ 2— 1 

' ' m—n ^ ' 


dn( 132)= ( 
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4. (a) This result was proved in [139]. 

(b) It is proved in [139] that 

-——!— = S„(4123,3214) = S n (2341,2143) 

O 

= >(1234,2143). 

5. This result was first proved in [82]. It is also proved as a corollary to more 
general work in [192]. 

6. This result is due to A.Reifegerste [163], who explored the connections 
between permutations counted by Schroder numbers and lattice paths. 

7. This result is due to A.Reifegerste, ibid. 

8. The concept of generalized patterns was introduced in [15]. This result 
was proved in [60] in two different ways, together with many other results 
concerning short generalized patterns. One way is to recall that the Bell 
numbers satisfy the recursion B(n + 1) = >> B(i) ("), and then 
showing that the 1-23 avoiding permutations satisfy this same recurrence. 
The other way is a direct bijective proof, based on a way of writing each 
partition in a canonical form similar to what we have seen for permutations 
in Chapter 3. (Write each block with its smallest element first, and then in 
decreasing order, then order the blocks in decreasing order of their smallest 
elements.) 

9. No, that is not true, and one can find a counterexample using any of the 
four non-monotone patterns of length three. For instance, a permutation 
contains 132 if and only if it contains 13-2. The “if’ part is obvious. For 
the “only if” part, let acb be a 132-pattem in p, and assume that the distance 
between the positions of a and c is minimal among all 132-patterns in p. 
That implies that if there is an entry d located between a and c, then d 
cannot be less than a (for deb), cannot be larger than b (for adb ), and cannot 
be between a and b in size (for deb). This means there cannot be any entries 
among a and c. 

10. This result is from the paper that introduced generalized patterns [15]. In 
that paper, the authors showed that essentially all Mahonian statistics in 
the literature can be expressed by generalized patterns. For the major index, 
one just has to add the descents of p, in other words, count the entries that 
precede a descent, then add these numbers for all descents. With this in 
mind, it is straightforward to see that 

maj(p)={ 1-32) £>(2-31) £>(3-21) £>(21) (p). 

11. This result can be found in M.Klazar, The Fiiredi-Hajnal conjecture implies 
the Stanley-Wilf conjecture, Formal Power Series and Algebraic Combinatorics, 
Springer, Berlin, 250-255, 2000. In that paper, the author in fact proves 
that formula (42) and the Fiiredi-Hajnal conjecture are equivalent. 
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12. See [175], where the authors prove this result introducing the interesting 
notion of shape-Wilf-equivalence. For two patterns q and <j, it is necessary 
for S n (q)=S n (c[) to hold for all win order for the patterns to be shape-Wilf- 
equivalent, but it is not sufficient. 

13. This result was proved by P.Valtr, but was first published in [130]. One 
can assume without loss of generality that q is indecomposable, and then 
build a sufficient number of decomposable permutations in which each 
block avoids q. 

14. (a) See [41] for a proof. 

(b) This follows from part (a), and the proof of part (b) of Exercise 30. 
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5.1 Polynomial Recursions 

5.1.1 Polynomially Recursive Functions 

In the previous chapter, we spent considerable time and effort to find out how 
large the numbers S n (q) are. In this chapter, we will mostly concentrate on how nice 
they are, or rather, how nice the sequence \S n (q)} lSH is. By abuse of language, we 
will often refer to this sequence as the sequence S n (q). 

First, of course, we should define what we mean by “nice.” We have already 
made one important definition, that of P-recursive (or polynomially recursive) 
sequences in Exercise 29 of Chapter 1, but for easy reference we repeat that 
definition here. 

DEFINITION 5.1 We say that a sequence f: N—»C is called P-recursive if there exist 
polynomials P 0 , Pi , ■ ■ •, Pf. £ Q[n], with P k ¥^ 0 so tlmt 

P k (n+h)f(n+k)+P k fn+kA]fn+k-l)+---+P l fn)f(n)=0 (5.1) 

for all natural numbers n. 

This definition, and some of the most important theorems in the theory of P 
recursive sequences, can be found in [177], which is the earliest paper completely 
devoted to this subject. Chapter 6 of [180] is a comprehensive source for results 
on Precursiveness. We do not want to duplicate existing literature on the topic, 
so we will not prove theorems that belong to the general theory of Precursiveness. 
There will be one exception to this, Theorem 5.12. 

See the exercises following Exercise 29 of Chapter 1 for some examples of P 
recursive sequences. 


Example 5.2 

The sequence of Catalan numbers is Precursive. Therefore, S„ (q) is Precursive 
for all patterns q of length 3. D 
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PROOF We have 

C n __ ( 2 ") n (2 n)l(n — l)!(n — l)!ra 4n ■*- 2 
Cn-i n + 1 Cn-i) (2n — 2)!n!n!(n + 1) i§ + 1 

therefore (w+1) ^-(4^-2) C b T= 0, providing a polynomial recursion. 

■ 

At this point, the reader should try to prove that the sum of two P-recursive 
sequences is also A recursive. 

Example 5.2, and some other examples of P-recursive sequences Sjq), such as 
5 n (1234), or in general, S n (123-P), suggested the following conjecture. 

COJVJECTURE 5.3 [105] [Gessel, 1990] For any permuta tionpattern q, the sequence 
Sfq) isP-recursive. 

Six years later, J.Noonan and D.Zeilberger looked at a larger class of sequences. 
Instead of trying to avoid a pattern, they wanted to contain it exaedy r times, 
where rwas a fixed nonnegadve integer. Numerical data and some theorems that 
we will cover in Chapter 7 led them to the following conjecture. 

COJVJECTURE 5.4 [Noonan, Jeilberger, 1997] Let S v (q) be the number of n-permutations 
that contain the pattern q exactly r times. Then for any fixed r and q, thefunction S„fq) is 
polynomially recursive. 


5.1.2 Closed Classes ofPermutations 

It seems that Conjecture 5.4 is much more ambitious than Conjecture 5.3. Indeed, 
the latter seems to be a special case of the former, namely the special case in 
which r= 0. However, certain versions of the two conjectures are actually equivalent 
as has been proved by Atkinson [10]. In order to understand Atkinson’s result, 
we need some new notions. We will use some very basic definitions from the 
theory of partially ordered sets. In the unlikely case that the reader has not seen 
these notions before, they can be found in most introductory combinatorics 
books, such as [27], 

Let P be the infinite partially ordered set of all finite permutations ordered 
by pattern containment. That is, in this poset we have p^ P q if q contains p as 
a pattern. This Pis an interesting poset and we will learn more about it in 
Section 7.2. 

DEFIJVlTlOJV 5.5 Let C be a class consisting of finite permutations. We say that C is 
a closed class of permutations f q £ C and p< P q imply p £ C ■ That is, a closed 
class is just an ideal of P. 
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As P is the only poset we consider in this Chapter, there is no danger of 
confusion, and therefore we are going to write p^q instead o{p< p q. 

Example 5.6 

Let p be any permutation. Then the class C(p) of all ^-avoiding permutations is 
closed. Q 

PROOF Indeed, let q be a /^avoiding permutation. If there existed q’<q so that 
/Ki^held, then by transitivity, we would have p'Cq'Cq, a contradiction. I 

It is straightforward to check that the intersection of closed classes, as well as the 
union of closed classes, is always closed. So, for instance, the class C(p h pP) of 
permutations avoiding both pi and p 2 is closed. 

Recall that in a partially ordered set, an element x is called a minimum if it is 
smaller than all other elements of the poset, while the element y is called minimal 
if no element of the poset is smaller than y. So a poset can have any number of 
minimal elements, but only one or zero minimum elements. 

Just as any ideal in any poset, a closed class C is determined by the minimal 
elements of the complement of C. These minimal elements obviously form an 
antichain. This antichain is called the basis of C. 

Example 5 .7 

The basis of the closed class C(p) is the one-element antichain consisting of p. 0 

Let Cbe a closed class that has basis p h p 2 , p/,. The reader is invited to prove 

the simple fact that then C=C(pi, p 2 , •••, pp must hold, that is, C must be the class 
of all finite permutations avoiding all p it for i £ [fc]. 

Taking a second look at Conjecture 5.3, we see that it in fact claims that if f(n) 
is the number of ra-pcrmu unions of a closed class C that has a one-element basis, 
then f(n) must be a P-recursive function. A stronger version of this conjecture 
that is still supported by numerical evidence is the following. 

CONJECTURE 5.8 For any permutation patterns qi, q 2l ■■■, q k , the sequence S n (q h q 2 , 
qp is P-recursive. In otherwords, iff(n) counts the elements of a finitely based closed class of 
permutations that have length n, thenf(n) is P-recursive. 

In other words, we replaced the condition of having a one-element basis by the 
condition of having a finite basis. There are closed classes of permutations that 
do not have a finite basis as P contains several infinite antichains. We will see 
some of these in Chapters 6 and 7. 

Let us strengthen Conjecture 5.4 similarly. Let r v r 2 , —, r t be nonnegative 
integers, and let q v q 2 , q k be permutation patterns. The class of permutations 
that contain at most r copies of q t for all i £ [&] is obviously closed. We will denote 
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this class by M(r v r 2 , —, r k , q v q 2 , —, qj, or, for shortness, M( r, q). Now the stronger 
version of Conjecture 5.4 can be stated as follows. 

C ONJECTURE 5.9 For all non-negative integers r v r 2 , —, r h , and all distinct patterns q v 
q 2 , —, q k , the sequence g(n) counting n-permutations in M(r, q) is P-recursive in n. 

It is easy to see by the Principle of Inclusion-Exclusion that Conjecture 5.9 is 
equivalent to the conjecture that the numbers h(n) of n-permutations containing 
exactly r copies of q t for all i is also P-recursive. 

We are now ready to state and prove the result of Atkinson we promised. 

THEOREM 5.10 

[10] Conjecture 5.8 and Conjecture 5.9 are equivalent. 

PROOF It is obvious that Conjecture 5.9 implies Conjecture 5.8. In order to 
show the other implication, it suffices to prove that M( r, q) is always finitely 
based. 

Let 7l be a permutation that is minimal in the complement of M(r, q). Then 
there is an i so that n contains at least r+1 copies of q t , for some i (E [A;]. Take the 
union U of these r+1 copies, then U is also not in M{ r, q). Therefore, by the 
minimality of iz, we must have n= U. This implies that the size of n is bounded (at 
most max i (r+l)|< 5 'J), so the number of possible 7ris bounded, and therefore M(r, 
q) is finitely based. | 


5.1.3 Algebraic and Rational Power Series 

Sometimes the generating function of a sequence has a much simpler form than 
the sequence itself. The following notion of d-finiteness, and the theorem after 
that, show how we can use these simple generating functions to prove P- 
recursiveness. 

DEFINITION 5.11 We say that the power series u(x) (E C[[x)]is dfnite fthere exists a 
positive integer d and polynomials P 0 (n),pfn), —,p d (n) so that p d stO and 

p d (x)u^ d \x) +Pd~i{x)v^ d ~ l) {x) H-b pi(x)u'(x) +p 0 {x)u{x) = 0, (5.2) 


HereyS^ — 4 ^- 
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We promised that we will prove one theorem from the theory of /'’-recursive 
functions. We are going to fulfill that promise now. The theorem in its explicit 
form is due to Richard Stanley. 

THEOREM 5.12 

The sequence■■■ is P-recursive if and only fits ordinary generatingfunction 

OO 

u (®) = Yi f( n ) xn (5.3) 

n=0 


is d-jinite. 

PROOF 

• First suppose u is d-finite, then (5.2) holds with/dH). Fix i<d. We start by 
finding an expression for p,(x) involving linear combinations of th ef(n+t) 
with polynomial coefficients. Differentiate both sides of (5.3) i times and 
then multiply both sides by d to get 

aduW = ^(n + i - j)if(n + i - j)x n . 

n> 0 

Here (m) =m(m-Y)--{m-i+Y). Now let a.be the coefficient of dinpfx). Multi¬ 
ply both sides by a, then repeat the entire procedure for each nonzero 
coefficient of p (x), and add the obtained equations. We get that 

Pi(x)u {i) = Yi (ll /(« + t)q t (n)) x n , 

n >0 V t ) 

where the q (n) are polynomials in n, and the sum in the parentheses is 
finite. 

Repeating the above procedure for all d, and then adding all the 
obtained equations, we get an equation that is similar to the last one, 
except that on the left hand side, we will have X(i=o Pi( x )u ll \ which is, 
by (5.2), equal to zero. If we compute the coefficient of x n+k on both sides 
of the last equation and equate the two expressions, we get an equation 
that involves only linear combinations of some f(n+t), 0 with 
coefficients that are polynomials in n. Therefore, this equation can be 
rearranged to yield a polynomial recurrence for/’ This recurrence will 
not be 0=0 as p d t^O. 

• Now suppose g(n) is P-recursive in n, so (5.1) holds. Note that for any 
fixed natural number i, the polynomials (n+i)j,j>0, form a C-basis for 
the vector space C[n], In particular, p,(n) is a linear combination of 
polynomials of the form (n+if. Therefore, using generating functions, 
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P I (n)f(n+i)y?‘ is a linear combination of generating functions of the 
form 

^(n + i) i /(n + z)a;” ^ 

n> 0 

with complex coefficients. 

Compare formulae (5.4) and (5.5). We see that the left-hand side of (5.5) 
almost agrees with x n u ijl that is, they can only differ in finetly many terms 
with all negative coefficients. Let the sum of these terms be 
Ri(x) £ x _1 K[x~f a Laurent-polynomial. If we multiply (5.1) by x” and 
sum over all nonnegadve n, we get 


0 = 



<lijxi 1 


+ R(x). 


(5.6) 


Here the sum is finite by the definition of ^-recursiveness and R(x) is a 
Laurent-polynomial. If we multiply both sides by xf where q is sufficiently 
large, the terms with negative exponents will disappear and we get an 
equation of the form (5.2). 


LEMMA 5.13 

The product of two d finite power series is d finite. 

PROOF See [180], page 192. | 

A useful consequence of Theorem 5.12 and Lemma 5.13 is that the convolution of 
two /'’-recursive sequences is P-recursive. 

LEMMA 5.14 

Let [f(k)} h and {g(m)} m be twopolynomially recursive sequences, and let 

n 

h (n) = Yj f( k )9( n ~ k )- (5.7) 

k= 0 


Then [h(n)} n is a P-recursive sequence in n. 

PROOF Let F(x), G(x), and H(x) denote the ordinary generating functions of the 
sequences j f(k)\, [g(m)} m , and [h(n)} x . Then F(x) and G(x) are both d-finite by 
Theorem 5.12. It is well-known that F(x)G(x)=H(x), so H(x) is d-finite by Lemma 
5.13, therefore [h(n)} n is P-recursive. 
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Note that by repeatedly applying Lemma 5.14, we get the statement that the 
convolution of several 7 J -recursive sequences is always P recursive in the sum of 
the variables. 

Knowing that a function is P-recursive is helpful, but often not sufficient to 
determine the function itself by interpolation. This is because we often do not 
know the degree of the recursion, or the degrees of the polynomials that appear 
in the recursion. Therefore, it is certainly useful to look for other, stronger 
properties that the sequence S n (q) has, at least for some q. 

DEFINITION5.15 Theformalpower series f E C [[a;]] is called algebraic if there exist 
polynomials Pfx), P\(x), ■ ■ ■, Pd(x) € C [x\ that are not all equal to zero so that 


P a (x)+P l (x)f(x)+-+P i (x)f(x)=Q. (5.8) 

The smallest positive dfor which such polynomials exist is called the degree off. 

Example 5.16 

Let f(x) = 1 ~ , the generating function of the Catalan numbers. 

Rearranging the previous equation as 1 — 2x f(x) = s/1 — 4x then taking squares 
and rearranging again, we see that 

xf (x)-}f(x)+x= 0, 

so /"is algebraic of degree 2. [j 

The following lemma shows the connection between algebraic and d-finite generating 
functions. 


LEMMA 5.17 

Iff £ C[[x]] is algebraic, then it is d-finite. 

PROOF See [180], page 190. | 

Note that the reverse of Lemma 5.17 is not true. The reader should try to find 
an example of a power series that is d-finite but not algebraic, then check 
Exercise 9. 

Even if having an algebraic generating function is a stronger property than 
being P recursive, sometimes it is easier to prove the latter by proving the former. 

Example 5.18 

The sequence S„(1342) is P-recursive. Q 
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PROOF We have seen in (4.8) that the ordinary generating function H(x) of this 
sequence satisfies 


H(x) = 


32 x 

—8x 2 + 20a; + 1 - (1 - 8a;) 3 / 2 ' 


We claim that H(x) is algebraic. To see this, it suffices to show that Z(x) = jjjyj 
is algebraic, because we can multiply both sides of the polynomial equation satisfied 
by Z(x) = jjhr y by a power of H(x), and obtain a polynomial equation satisfied 
by H(x). On the other hand, routine transformations show that 


(32x^(x)+8^-20x-l) 2 ={l-8x) 3 , 


so Z( x ) does satisfy a polynomial equation. | 

We would like to point out that the ordinary generating function of the sequence 
S„(1234) is not algebraic (see Problem Plus 10). So yet again, patterns of length 
four turn out to be quite surprising. The monotone pattern 1234 is not only not 
the easiest or the hardest to avoid, it is also not the “nicest”! 

It is known that 5J123—A) is /^recursive for any k. We will prove this result in 
our chapter on Algebraic Combinatorics, when we learn about the interesting 
connections between permutations and Standard Young Tableaux. 

An even smaller class of power series is that of rational functions. A rational 
function is just the ratio of two polynomials, such as A rational function 

is always algebraic of degree 1, (as multiplying it by its denominator we get a 
polynomial), and therefore, d-finite. 


5.1.4 The P-recursiveness of S nr (132) 

In this subsection, we present the currently strongest result of the field by proving 
the following theorem. 

THEOREM 5.19 

Let r be any fixed nonnegative integer. Then the sequence 5 V (132) is P-recursive. Moreover, the 
ordinary generating function of this sequence is algebraic. Even more strongly, this generating 
function is rational in the variables x and fil — 4x. 

The proof of this theorem will have a quite complicated, but elegant structure. So 
that we do not get lost in the details, it is important to get an overview of our 
goals first. 

Note that no matter how large the number r is, it is a fixed number. On the 
other hand, n is changing, and will eventually be much larger than r. So the 
requirement that our permutations have only r copies of 132 really means that 
they have these copies in exceptional cases only. 
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FIGURE 5.1 

A generic permutation and its colored entries. 


In particular, every time there is an entry on the left of the maximal entry n that 
is smaller than an entry on the right of n, a 132-pattem is formed. This simple 
observation is crucial for us. Therefore, we introduce special terminology to 
discuss it. 

Entries of an n-permutation p on the left of the entry n will be called front entries, 
whereas those on the right of n will be called back entries. Front entries of p which 
are smaller than the largest back entry of p will be called black entries, whereas 
back entries of p larger than the smallest front entry of p will be called gray entries. 
Note that not all entries will have colors. See Figure 5.1 for an illustration. 

Why are we coloring our entries? First, any black entry is smaller than any 
front entry which is not black, while any gray entry is larger than any back entry 
which is not gray. In other words, black entries are the smallest front entries, 
while gray entries are the largest back entries. Moreover, any black and any gray 
entry is part of at least one 132-subsequence. Indeed, take any black entry x, the 
entry n, and any back entry larger than x. A dual argument applies for gray 
entries. Finally, if a 132-subsequence spans over the entry n, that is, it starts with 
a front entry and ends with a back entry, then it must start with a black one and 
end with a gray one. 

Starting now, we are going to partition, and partition, and partition. That is, 
we will partition the set of w-permutations that contain r copies of 132 into many 
classes. How many classes? A lot, but the number of classes will not depend on 
n, just on r, so it will be a fixed number. Then we are going to show that within 
each class C, the number of permutations S l32rC (n) that are in class C and have r 
copies of 132 is a polynomially recursive function of n. With that, we will be done 
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by summing over all classes C. Indeed, the sum of a fixed number of Precursive 
functions is P-recursive. 

Let us start this partitioning. 

(a) As we said above, any colored entry is part of at least one subsequence of 
type 132 that spans over n. Therefore if p has exactly r subsequences of 
type 132, and B (resp. G) denotes the number of black (resp. gray) entries, 
then max (B, G) <r. This implies that we have at most r 2 choices for the 
values of B and G. 

(b) Once the values of B and G are given and we know in which position the 
entry n is, then we only have a bounded number of choices for the set of 
black and gray entries. Indeed, if x is the smallest black entry, then x is 
larger than all but G back entries. Thus, if n is in the ith position, then x>n- 
i-G. On the other hand, x is the smallest front entry, thus x<n-i+i. Similar 
argument applies for the largest gray entry. 

(c) Finally, there is only a bounded number of positions where a black entry can 
be. Indeed, if xis black and y>x is aback entry, then xzy is a 132-pattern for 
any front entry z on the right of x which is not black (and thus, is larger than 
any back entry). Recall that x nyis such a pattern as well. Thus, if / is the 
number of such (x, z) pairs, then we have t+G^r. In particular, the distance 
between any black entry and the entry n cannot be larger than r. 

The following definition makes use of the observations we have just made: 

DEFINITION 5.20 We say that the n-permutations pi and p 2 are in the same strong 
class if they agree in all of t!w following: 

• the position of t!w entry n 

• the set of black entries 

• the set of gray entries 

• the patternformed by the gray entries 

• the position of the black entries 

• the pattern starting with the leftmost black entry and ending with t!w entry n. 

In other words, permutations of the same strong class agree in everything that 
can be part of a 132-pattem spanning through the entry n. Note that once the 
position of n is given, there is only a bounded number of possibilities for the 
strong classes. 
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DEFINITION5.21 Letp be an n-permutation. The subsequence ofp consisting of 


• all black and gray entries and 

• all front entries which are preceded by at least one black entry and 

• the entry n 

is called the fundamental subsequence ofp. 

This means that permutations of the same class have fundamental subsequences 
of identical type, and these subsequences are in the same positions in every 
permutation of a given class. 

DEFINITION 5.22 The classes C and C’ are called similar f their permutations have 
fundamental subsequences of the same type. 

Thus in this case the subsequences do not need to be in identical positions. 

Example 5.23 

The classes containing the permutations 34152 and 42513 are similar. Q 


5.1.4.1 The Inductive Proof 

Our proof of Theorem 5.19 will be an induction on r. 


5.1.4.2 The initial step 

To make our argument easier to follow we introduce some new notions: 

DEFINITION 5.24 Let q be a pattern. Then to insert the entry y to the jth position of q 
is to put y between the j-lst and t!w jth entry ofq and to increase the value of any entry vfor 
which originally vfry held by 1. 

The deletion ofan entry is obtained similarly: erase the entry and decrease all entries larger 
than it by 1. 

LEMMA 5.25 

Let q be any pattern of length k. Then the number Cfn) of 132-avoiding npermutations that 
end with a subsequence of type q is a P-recursivefunction ofn. 

PROOF The proof is by induction on k. If ^=0, then Cfn)=C n — ( 2 ”) / (n +1) 
the nth Catalan-number and we are done by Example 5.2. Suppose we know the 
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statement for all subsequences of length k-\. We will prove it for the subsequence 
q, which has length k. 

If q is not 132-avoiding, then clearly C (n)= 0. So we can suppose that ry is 132- 
avoiding. Now we consider two separate cases. 

(a) If the first element q 1 of q is not the smallest one, then the entry 1 of our n- 
pcrmuiation p cannot be on the left of (j\, thus in particular, the entry 1 of 
p is one of the last k entries, which form a subsequence of type q. Then it 
must be the smallest of these last k entries, thus we know exactly where the 
entry 1 of p is located. Let us delete the smallest entry of q to get the 
subsequence q. Apply the induction hypothesis to q ’ to get that G q - (n- 1) is 
P-recursive. Then insert 1 to its original place to see that C rl (n) is P-recursive. 

(b) If qi is the smallest element of q, then it is easy to apply what we have just 
shown in the previous case. Let q" be the subsequence obtained from q by 
deleting q } . Moreover, let r 2 , r 3 , r k (resp.) be the subsequences whose last 
k-1 elements determine a subsequence of type q" and whose first elements 
(resp.) are 2, 3, —, k. Then it is obvious that 

k 

C q (n) = C q n (n) - c <u ( n ) (5.9) 

t=2 

The first term of the right-hand side is P-recursive by induction and 
the second one is P-recursive by the previous case, and the lemma is 
proved. 

This completes the proof. We will refer to the method applied in case (b) as the 
complementing method. 

COROLLARY 5.26 

Let q be a pattern of length k. Then the number Kfn) of 132-avoiding n-permutations in 
which the k largest entriesform a subsequence of type qis a P-recursivefunction ofn. 

PROOF This is true as K q {n) — C q -i(n). Indeed, taking inverses turns 
the last k entries of a permutation p into the k largest entries of the 
permutation p x . 


5.1.4.3 The induction step 

Now would be the time to carry out the induction step of our inductive 
proof. It turns out that this is easier to do if we generalize our statement even 
further. 
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THEOREM 5.27 

Let q be any pattern of length k. Then the number Cq,r(n) of n-permutations which contain 
exactly r subsequences of type 132 and end with a subsequence of type q is a P-recursive 
junction of n. 

Note that Theorem 5.19 is a very special case of Theorem 5.27. Also note that 
equivalently we can state the theorem for the numbers K v (n) of w-permutations 
which contain exactly r subsequences of type 132 and in which the largest k 
entries form a subsequence of type q. 

PROOF Induction on r. If r= 0, then Theorem 5.27 reduces to Lemma 5.25 and 
Corollary 5.26. 

The inductive step will be carried out in a divide and conquer fashion. For shortness, 
we call the statement of Theorem 5.19 the weak statement and we call the statement 
of Theorem 5.27 the strong statement. 

For our induction proof, we have to show that the strong statement for r-1 
implies the strong statement for r. We do this in two parts. 

(a) We show that the strong statement for (r-1) implies the weak statement for 
r, then 

(b) we prove that the weak statement for r implies the strong statement 
for r. 

Let us follow up on these promises. 

(a) Assume now that the weak statement holds for r-1. Choose any class C of 
w-permutations. Suppose the fundamental subsequence type of C contains 
exacdy s subsequences of type 132, where ,s<r. 

• Suppose j> 1. How can a permutation in C contain 132-patterns which 
are not contained in the fundamental subsequence? Clearly, they must 
be either entirely before the entry n or entirely after it. If there are i 
such subsequences before n and /such subsequences after n, then i+j+s=r 
must hold. Denote by the pattern of all front entries in the fundamental 
subsequence and by eg the pattern of all back entries there. Then with 
the previous notation we have 

f[n v n 2 , q v q 2 , i,j, s)=C ql .{nfKq Vj {n 2 ) 

such permutations, where n, (resp. n 2 ) denotes the number of front (resp. 
back) entries which are not in the fundamental subsequence. Indeed, en¬ 
tries of the fundamental subsequence are either the rightmost front entries, 
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or the largest back entries. We know by induction that C f i,,(wi) is P-recursive 
in ni and is Precursive in n 2 . Therefore, their convolution 



ni+ri2=n 




(5.11) 


ni+«2=n 


is Precursive in n. Clearly this convolution expresses the number of w-per- 
mutations with exactly r subsequences of type 132 in all classes similar to C. 

It is clear now that we have only a bounded number of choices for i, j and 
j so that i+j+s=r, thus we can sum (5.10) for all these choices and still get 
that 



(5.12) 


is Precursive in n. (Recall that i>0, thus we can always use the induction 
hypothesis). Summing (5.12) for all q and q 2 we get that 


/(«) = f( n ^ i»92) 


(5.13) 


91,92 


As in this section, the only pattern we study is 132, we simplify our nota¬ 
tions a little bit by writing Sfn) for 5 (132). 

• Now suppose j= 0. Then any 132-subsequence must be either entirely 
on the left of the entry n or entirely on the right of n. Moreover, the 
position of n completely determines the set of the front and back entries. 
If n is in the z-th position, and we have j 132-subsequences in the front 
and r-j in the back, then this gives us 


g(hj)=Spr^)Sjn-i) 


(5.14) 


permutations of the desired kind. If 1 then the induction hypoth¬ 

esis applies for Sj and S r .j, therefore, after summing (5.14) for all i, 


9( n ) = Y2 S “ l) s r-j ( n - *) 


(5.15) 


is P-recursive in n, II /=0 or j=r, then we cannot apply the induction hypothesis. 
By a similar argument as above we get nevertheless that in this case we have 



(5.16) 


?z-permutations with exactly r 132-subsequences. (We remind the reader 
that S 0 (n-i)=C ni the (n-i)th Catalan-number). 


2004 by Chapman & Hall/CRC 


In This Way, But Nicely. Pattern Avoidance. Followup. 
Summing (5.13), (5.15) and (5.16) we get 


189 


S r (n) = f(n) + g(n) + 2 • ^ S r (i - 1) * C n -i. 


(5.17) 


Let F, G, C and S denote the ordinary generating functions of f(n), g(n), C n 
and S (n). Then the previous equation yields 


S r (x)=F(x)+G(x)+2x-C(x)S(x), 


that is, 



(5.18) 


Therefore S(x) is d-finite as it is the product of two d- finite power series. 
Indeed, the numerator is d-finite and 1/(1 — 2 xC{x)) = 1/Vl — 4x is d- 
finite. Thus S (n) is P-recursive and we are done with the first part of the 
proof. 

(b) Let q be any subsequence of length k. We must prove that the number 
C v (n) of w-permutations that end with a subsequence of type q and contain 
exacdy r subsequences of type 132 is a P-recursive function of n. Clearly, if 
q contains more than r 132-subsequences, then C q>r (n)= 0 and we are done. 
Otherwise we will do induction on k, the case of k= 1 being obvious. There 
are three different cases to consider. 

• If q has more than r inversions, then it is obvious that no such 
permutation can have its entry 1 on the left of the last k elements. 
Therefore, this entry 1 must be a part of the ^-subsequence formed by 
the last k elements. Now deleting this entry 1 we may or may not lose 
some 132-patterns as there may or may not be inversions on its right, 
but we can read off this information from q. (See the next example). 
Again, let q’hc the pattern obtained from q by deleting its entry 1. If 
we do not lose any 132-patterns by this deletion, then we are left with 
an (n-1)-permutation ending with the pattern q’ and having r 
subsequences of type 132. If we lose t such patterns, then we are left 
with an (rc-1)-permutation ending with q’ and having r-t such 
subsequences. Both cases give rise to a P-recursive function of n by our 
induction hypothesis as ^Ts shorter than q. 

Example 5.28 

If q= 3 41 6 5 2, then q =2 3 541 thus we lose three subsequences of type 
132 when deleting 1. Therefore, we can apply our inductive hypothesis for 
r-3, then reinsert the entry 1 to its place. If q= 3 1 2 4, then we don’t lose 
any 132-patterns when deleting the entry 1 and getting q’= 2 1 3. Thus we 
still need to count permutations with r 132-patterns, but they must end 
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with q, not with q. The pattern </is shorter than q, thus the induction 

hypothesis on k can be applied. 

• If q has at most r inversions, but q is not the monotonic pattern 1 2—k, 
then it can also happen that the entry 1 is not among the last k entries 
of our permutation. However, we claim that it cannot be too far away 
from them. Indeed, let y be an element from the last k elements of the 
permutation (thus one of the elements of the set L of the last k entries 
that form the ending q) which is smaller than some other element 
x <E L on its left. Then clearly, if n is large enough, then y must be 
smaller than r+k+ 1, otherwise we would have too many 132-patterns 
of the form w x y. So y is bounded. If the entry 1 of the permutation 
were more than 2r+k+\ to the left of y, then there would necessarily 
be more than r elements between 1 and y which are larger than y, a 
contradiction. Thus the distance between 1 and y is bounded. Therefore 
we can consider all possibilities for the position of the entry 1 of the 
permutation and for the subsequence on its right. In each case we can 
delete the entry 1 and reduce the enumeration to one with a smaller 
value of r, (as q has at least one inversion), then use the inductive 
hypothesis on r. Thus this case contributes a bounded number of P- 
recursive functions, too. 

• Finally, if q is the monotonic pattern 1 2---k, then use the complementing 
method of Lemma 5.25. 

Now note that speaking in terms of ordinary generating functions, all operations 
we made throughout the induction step were either adding or multiplying a finite 
number of power series together. In particular, the ordinary generating function 
C(x) of our initial C„-sequence (that is, when r= 0 and k=0) is C(x) = 1 ~^~ 4a: , 
thus an algebraic power series. Therefore, the ordinary generating function of 
S r (n), the power series S r (x), is algebraic, too. 

Now note a bit more precisely that throughout our proof we have either 
added formal power series together, or, as in (5.14), multiplied two functions of 
type Sj{irl) together, or, as in (5.18), multiplied a power series by the power series 
1/(1 — 2x ■ C(x)) = l/i/l — 4x■ Therefore, the following proposition is 

immediate: 


PROPOSITION 5.29 

Let r> 1, and recall that S r (x) denotes the ordinary generatingfunction of the numbers S r (n). 
ThenS r (x) € C[[x, fl — 4x]]. Moreover, when written in smallest terms, the denominator 
ofS(x) is a power off 1 — 4x. 
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It is convenient to work in this setting as the square of — 4x is an element 
of C [x], which makes computations much easier. 

We are going to determine the exponent f(r) that fl — 4x has in the 
denominator of G r (x). Equations (5.14) and (5.18) show that 

f(r)=max lsKr (f(i)+f(r-i))+l. (5.19) 

We now claim that f(r)=2r-\. It is easy to compute (see [33]) thaty(l)=l. Now 
suppose by induction that we know our claim for all positive integers smaller 
than r. Then (5.19) and the induction hypothesis yield that for some i, we have 

f( r ) = (f(i)Tf( r ~i)) + l=(2z-l)+(2r-2zT)+l=2r-l, 


which was to be proved._ 

Recall now that V's/T — 4x = J2 n >o ( 2 n') x ' n and that the sequence ( 2 ^) 
satisfies a linear recursion. Differentiate both sides of this equation several times. 
On the left-hand side, each differentiation will add two to the exponent of in 
i/l — 4x the denominator. On the right-hand-side, it will add one to the degree 
of the highest-degree polynomials appearing in the recursive formula for the 
coefficients. Thus, differentiating r-1 times we get that the denominator of Sfx) 
gives rise to a polynomial recursion of degree r. The numerator of S, (x) cannot 
increase this degree. 

We collect our observations in our last lemma: 

LEMMA 5.30 

Let r> 1 and let S r (x) be the generatingfundionfor the sequence ]5„ r (132) } n . Write Sfx) in 
lowest terms. Then the denominator of Sfx) is equal to 

(fl - 4x) 2r ~ 1 = (1 - Axf- 1 ■ vT - 4x. 

Therefore, the sequence ]5 ir (132) } n satifies a polynomial recursion with degree at most r. 


5.2 Containing a pattern many times 
5.2.1 Packing Densities 

In this section we will turn around and instead of trying to avoid a pattern, or 
containing a pattern just a few times, we will try to contain a given pattern q as 
many times as possible. More precisely, for fixed q and n, we want to find the 
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largest integer M n>ll so that there exists an w-permutation that contains exactly M n>(j 
copies of q. However, for divisibility or other reasons, it can happen that not all 
integers n behave the same way as far as containment of many copies of q is 
concerned. Therefore, a more global way of measuring how many copies of q can 
be packed into an w-permutation is desirable, and is provided by the following 
definition. 


DEFINITION 5.31 Let q be a pattern ofk elements. Then the packing density ofq, denoted 
g(q), is given by 


9(Q) = 


lim ^ 

n—> oo 


C) 


In other words, the packing density of q tells us how large a portion of all k- 
element subwords of an w-permutation can be copies of q as n goes to 
infinity. 

The alert reader probably caught us red-handed in committing the sin of using 
lim^oo fft to define packing density, without proving first that that limit exists. 

The limit does exist, however, as can be seen from the following result of Galvin, 
which appeared in [158]. In what follows, if an w-permutation^ contains M(n, q) 
copies of q, then we will say that p is a ^-optimal permutation. 

LEMMA 5.32 

Let q be a pattern of length k. Tlunfor n>k, t!w sequence 

Mn,q 

(!) 

is nonincreasing, therfore^ lm nsoo exists. 

PROOF Let g(n, q) = M n ^/(^). We show that 


g( n > q), 

which will clearly imply our claim. 

Letjbbe an w-permutation that is ^-optimal, that is, contains a maximal number 
of copies of q. Let p < i>, p< 2 >,-- -p<e> denote the n permutations of length n -1 that 
we obtain from p by omitting the first, second, and so on, entry of p. It is clear 
that the proportion g(n, q) of ^-copies among all A-element subwords of p is the 
average of the proportions prop(p, q, i) of ^copies among all A-element sub words 
of the p<i>. (The average is taken for all i G [«■].) Therefore, there has to be at 
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least one i that is not below average, that is, for which g(n, q)< prop(p, q, i)<gfi-l, 
q). This shows that our sequence is nonincreasing, and therefore, consisting of 
non-negative elements, convergent. | 

If q is a monotone pattern of length k, then the answer is trivial. Indeed, we 
have M U} q = (^), and this maximum is attained on the monotone permutation 

of length n. Therefore, the packing density of any monotone pattern is 1. Hence, 
for the remainder of this section, we will assume that q is not monotone (unless 
otherwise stated). 


5.2.2 Layered Patterns 

For a general pattern, however, the question of determining M„ ri is still unsolved. 
There is a special kind of pattern of which we know significantly more. These are 
the so called layered patterns. 


DEFINITION5.33 A pattern q is called layered if q can be written as the concatenation 
qi q 2 q m of the strings qi,for 1 <i<m, where 

(a) each q t is a decreasing sequence of consecutive integers, and 

(b) the leading entry of q, is smaller than tlw leading entry of q i+ i,for 1 . 

Example 5.34 

The patterns 1432765 and 4321576 are both layered. Q 

See Figure 5.2 for an illustration. 

Finding M m is much easier for layered patterns q than for general patterns. 
This is because of the following unpublished theorem of Stromquist. 

THEOREM 5.35 

Let qbe a layered pattern. Then there exists a layered permutation p so thatp has exactly 
copies of q. 

That is, among the w-permutations that are ^-optimal, there is at least one 
layered permutation. The number of layered permutations of length n is 
obviously 2”' 1 , which is much less than n\, so the above theorem is extremely 
useful if we want to find a ^-optimal permutation with the help of a computer 
program. 

We point out that while it is not true in general that all ^-optimal permutations 
are layered, there are special cases when they are. See Exercise 27 for a large 
special case. 
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\ 

FIGURE 5.2 

The diagram of a generic layered permutation. 


What can we say about the structure of a layered ^-optimal permutation when 
q is layered? Even this question is unsolved in this generality. However, there are 
certain precise answers in the case when q has only two layers. We will discuss the 
shortest of these patterns, the pattern q=132. For the more general case of 
q= \k(k-\)—32, see [37]. For a different, and also more general treatment, see 
[158]. 


5.2.2.1 The pattern 132 

We want to construct an w-permutation that contains 132 as many times as 
possible. Our intuition might suggest that (disregarding questions of divisibility) 
we split the set [n] into three parts, then place entries from 1 to n!3 into the first 
nJ3 positions, place entries from ^ + 1 to into the last nJ 3 positions, finally 
place entries from ^ + 1 to n into the middle nl?> positions. Then apply the same 
strategy recursively in each of the three tiers we have just created. This construction 
imitates the pattern 132 quite closely, so it seems reasonable to conjecture that it 
is optimal. Nevertheless, this conjecture is false. For instance, if n= 9, then this 
method constructs the permutation 132798465 that has 32 copies of 132 (27 
containing one entry from each tier, 3 within one tier, and 6 containing one entry 
from the first tier and two entries from one of the other tiers). On the other 
hand, the permutation 1 32 987654 contains 3 x (®) + 1 = 46 copies of 132, 
which is significantly more than 32. Therefore, we have to look for 132-optimal 
permutations with other methods. 

Let/) be a 132-optimal w-permutation that is layered. We know from Theorem 
5.35 that such a permutation exists. We will show that layered 132-optimal 
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permutations have a simple recursive structure. This is because of the fact, which 
we will use many times, that to form a 132 pattern in a layered permutation one 
must take a single element from some layer and a pair of elements from a 
subsequent layer. 

PROPOSITION 5.36 

Letp be a layered 132-optimal n-permutation whose last layer is of length m. Then the leftmost 
k=n-m elements of p form a 132-optimal k-permutation. 

PROOF Let D t be the number of 132-copies of p that are disjoint from the last 
layer. The number of 132-copies of p is clearly fc(™) + Df.- So once k is chosen, 

p will have the maximum number of copies if D h is maximal. (Note that this 
argument works even if m = 1 as Q) = 0.) | 

We point out that the proof of this proposition uses the fact that 132 has only 
two layers, the first of which is a singleton. 

As we are now concentrating on copies of 132, we simplify our notation by 
setting 32 . That is, M„ denotes the maximum number of 132copies an n- 

permutation can have. Then the previous proposition implies that 



(5.20) 


The integer k for which the right hand side attains its maximum will play a crucial 
role throughout this section and the next one. Therefore, we introduce specific 
notation for it. 

DEFINITION 5.37 For any positive integer n, let k n be the positive integerfor which 



is maximal. If there are several integers with this properly, then let k n be the largest among 
them. 

In other words, k„ is the largest possible length of the remaining permutation 
after removing the last layer of an optimal w-permutation p. When there is no 
danger of confusion, we will only write k instead of k n , to simplify notation. We 
will also always use m=n-k to denote the length of the last layer of p. 

We continue our search for 132-optimal w-permutations. The construction at 
the beginning of this section, that was shown not to be optimal, leads to n- 
permutations with 
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copies of 132. Our goal is to find a construction, for general n, that provides 
more copies. The proof of the following lemma provides such a construction. 

LEMMA 5.38 

We have 


9(132) = lim -f- > 0.464. 

n ~^oo 

PROOF Let us look for 132-optimal w-permutations in a specific form, which we 
will call geometric form. In these permutations, disregarding questions of divisibility, 
the last layer is of length m=t-n (for some /<1), the penultimate layer is of length 
(1 -i)tn, the one before that is of length {\-t)1ln, and so on. In other words, the 
layer lengths form a geometric progression with quotient 1 -t. 

How many 132-patterns will such a permutation^ (t) contain? It follows from 
the layered structure oip(t) that in a copy of 132 in p(t), the last two entries will 
come from the same layer. We will count the 132-copies according to the layers 
that contain their last two entries. 

There are (™) k —■ | ■ t 2 ( 1 — t)n 3 — 0(n 2 ) copies of 132 that have their last 
two entries on the last layer of p(t). Similarly, there are roughly i - £ 2 (1 — tfn 3 
copies of 132 that have their last two entries on the penultimate layer. There are 
roughly k •t 2 ( 1 — t) 7 n :i copies that have their last two entries on the layer before 
that, and so on. Therefore, if wis large enough, the n-permutationj!)(2) created this 
way will have 

y * 2 (1 - 1) 5^(1 - t) 3j - 0(n 2 logn) = 2 ” * - 0(n 2 logn) (521) 

copies of 132. Indeed, there are at most 0(log n) layers, and on each layer, the 
error term is at most 0(n 2 ). Note that for instance for £=2/3, (this is what we used 
in our counterexample above), the above formula shows that we have n 3 /13-0(t z 2 
log n) copies of 132, which is significantly more than the w 3 /24 achieved by our 
first construction. However, t= 2/3 is still not the best choice. To find the best 
choice for t, we simply use elementary calculus to find the maximum of the 
function 

f(A = _ *(1 ~ *) 

2(1 — (1 — t) 3 ) 2(t 2 — 3/ + 3) ’ 

We find that the maximum ofy Zon the interval [0, 1] is taken at t = 3 ~ 

0.634, so this is the best choice for t. 

Our construction works for all n, proving that 

M n > ———r n 3 - 0(n 2 logn). 

- 2 {t 2 — 31 + 3) v ; 
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It is then clear from the definition of g(132) that 


5(132) 


lim 


Mn 

6 ) 


> lim max —— 
n-vootg[o,l] 1/6 


0.464. 


(5.22) 


■ 

We claim that in the long run, nothing beats patterns in geometric form. In other 
words, the above construction is asymptotically optimal among all n permutations. 
By this we mean that the coefficient of ri' in M„ cannot be improved. This is the 
content of the following theorem. 


THEOREM 5.39 

[158] The above construction is asymptotically optimal, that is, 

g(132)=0.46 


PROOF Let 


B = 


3z(l — z) 

max -- - 

z€[0,l] 1 + Z + Z 2 


max h(z). 
*efo,i] 


(5.23) 


We will first prove that B=g{ 132), then we will prove that 


B = 6 max fit). 


which will complete the proof of our theorem by (5.22). To prove the first claim, 
it suffices to prove that 





(5.24) 


or, in other words, that for all n, we have 


ip 

Mn < B-. 

6 

We are going to prove this claim by induction on n. For re=l, the statement is true 
as the left-hand side is equal to 0. Now assume that we know the statement for all 
positive integers less than n. 

Recall that we have described the structure of 132-optimal permutations in 
Proposition 5.36, and in the subsequent formula 5.20. Therefore, we have 

M„ = 

= max I Mk + 
fc V 


„, , , m 

max I Mk + k 
k \ V 2 


(5.25) 


< max (Mf. + fem 2 ) 
k 


(5.26) 
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= max 
k 


( 




(5.27) 


= ( max (y (5(1 - tf + 3(1 - t)t 2 )^ . (5.28) 

In the last step, we dropped the requirement that tn=m and (1-/)«=/; be 
integers. 

Remember that B was defined as a maximum in (5.23). From that definition 
it follows that 


or, setting t=\-z, 


3z(l — z) 


2 — 


< B 


1 + z + z : 

30(1 - zf < B(l - z 3 ) 
Bz 3 + 30(1 - zf < B, 


B(l-t) 3 +3(l-t)f<B. 


Comparing this last equation with the last term of (5.28) proves (5.24). So we 
have shown that g(132 )=B. 

Therefore, our theorem will be proved if we show that B = 6 max (G [ 0 ,i] /(*)- 
This is immediate if we observe that A(z)=6/(l-z). | 

As this is our first result in the theory of packing densities, it is worth pointing 
out the interesting fact that while our problem is about integers, the best ratio of 
consecutive layer lengths in order to contain many copies of 132 was provided by 
an irrational number, the number 


5.3 Containing a pattern a given number of times 

In the last section, we looked for permutations that contained a given pattern 
as many times as possible. In this section, we will take a different approach and 
we will be looking for permutations that contain a given pattern a given number 
oftimes. 

DEFINITION 5.40 Let nbe afixed positive integer, and let q be afixed pattern. Let S n ,c(q) 
be the number ofnfermut adorn with exactly c patterns oftype q. The sequence \S mc (q) } cS0 is 
called the frequency sequence of t!w pattern qfor n. 

The alert and meticulous reader will point out that we have already seen an 
example for a frequency sequence, and we have proved several interesting 
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statements about it. Indeed, inversions, our main topic in Section 2.1, are nothing 
else but 21-pattems. We have seen that the frequency sequence of inversions is 
log-concave, and is therefore unimodal. 

When q is longer than 2, numerical evidence suggests that the frequency 
sequence of q will no longer be unimodal, let alone log-concave. In fact, disturbing 
objects called internal zeros seem to be present in most frequency sequences. 

DEFINITION 5.41 An integer c is called an internal zero of the sequence [S^fq) } cS $fwe 
have S n , c (q)= 0, but there exist c 1 and c^ so that 

(a) cfNf, and 

(b) S nrCl (q ) > 0, and S„, C2 (q) > 0- 

Example 5.42 

Let 5=123, and let h>3 be any positive integer. Then c = ( '*) — 1 is an internal 
zero of the sequence (S„fq)) cs , 0 as we can have c\ = f 1 ” 1 ) and C 2 = (") for the 
permutations wl2--w-l and 123--W, respectively. [J 

5.3.1 A Construction With a Given Number of Copies 

In this subsection we show that there are infinitely many integers n so that the 
sequence T’=>S’,„,(132) does not have internal zeros. We will call such an integer, 
or its corresponding sequence, NIf (no internal zero), and otherwise If. Our strategy 
is recursive; we will show that if k n is NIZ, so is n, where k n was described in 
Definition 5.37. As k<di, this will lead to an infinite sequence of NIZ integers. 
There is a problem, however. In order for this strategy to work, we must ensure 
that given k, there is an n such that k=k„. This is the purpose of the following 
theorem. 


THEOREM 5.43 


The sequence diverges to infinity and satisfies 


k <k^<k+l 

n n+1 n 


for all n> 1. So, in particular, for all positive integers k there is a positive integer n so that 

k n =k. 


The next subsection is devoted to a proof of this theorem. We suggest that 
the reader assume the result now and continue with this section to preserve 
continuity. Keep the notation M k from the previous section. Before starting the 
proof of our main theorem, we need only note the useful fact that 



(5.29) 
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which follows by considering the permutation l£(M)(A-2)---32. 

THEOREM 5.44 

There are infinitely many NTf integers. 


PROOF By Theorem 5.43, it suffices to show (for sufficiendy large £„) that if k„ 
is NIZ then so is n. By the same theorem, we can choose n so that k„ is NIZ, and 
£„>4. To simplify notation in what follows, we will write k for k n . 


Now given c with 0 < c < M n = Mk+kiff'), we will construct a permutation 
p £ S n having c=c(p) copies of 132. Because of (5.29) and 4, we have M t >k- 
1. So it is possible to write c (not necessarily uniquely) as c=ks+t with 0 < s < (™) 
and Q<t<M k . Since k is NIZ, there is a permutation p 1 £ Sk with c(p)=t. Also, it 
is not difficult to prove (see Exercise 15) that there is a permutation in S m with no 
copies of 132 and s copies of 21. Let p" be the result of adding k to every element 
of that permutation. Then, by construction, p — p'p" £ S n and c(p)=ks+t=c as 
desired. | 

One can modify the proof of the previous theorem to locate precisely where the 
internal zeros could be for an IZ sequence. We will need the fact (established by 
computer) that for 12 the only IZ integers are 6 , 8 , and 9, and that they all 
satisfied the following result. 

THEOREM 5.45 

For any positive integer n, the sequence F n does not have internal zeros, except possiblyfor c=M n - 
1 or c=M n - 2, but not both. 

PROOF We prove this theorem by induction on n. Numerical evidence shows 
that the statement is true for 12. Now suppose we know the statement for all 
integers smaller than n, and prove it for n. If n is NIZ, then we are done. 

If n is IZ then, by the proof of Theorem 5.44, k=k n is IZ. So £>6 and we have 
M k >k+ 2 by (5.29). Now take cwith 0< csM„-3 so that we can write c=ks+t with 
0 < s < ) and 0^i^M r 3. Since the portion of F h up to 5 * 432 (Af*-3) has no 

internal zeros by induction, we can use the same technique as in the previous 
theorem to construct a permutation p with c(p)=c. for c in the given range. 
Furthermore, this construction shows that if S^fM^-iJ^O for j=l or 2 then 
Sn,-t 32 (M n -i)¥=Q. This completes the proof. | 
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5.3.2 The sequence {A} 0 

In order to prove Theorem 5.43, we first need a lemma about the lengths of 
various parts of a 132-optimal permutation p. In all that follows, we use the 
notation 

b = length of the next-to-last layer of p 

a = length of the string created by removing the last two layers of p 
= n — m — b 

= k — b. 

The following lemma summarizes some innocent-looking properties of the above 
layer lengths that we will need in our proof of Theorem 5.43. As the reader will 
see, these properties are actually not all as immediate as they look. 

LEMMA 5.46 

We have t!wfollowing inequalities 

(i) b<m, 


(ii) a<(m-l)!2, 


(Hi) m>k which implies m>nl2 and A<?z/2, 


(iv) m<2(w+l)/3. 


PROOF The basic idea behind all four of the inequalities is as follows. Let p 'be 
the permutation obtained from our 132-optimal permutation p by replacing its 
last two layers with a last layer of length m ’ and a next-to-last layer of length b ’. 
Then in passing from p to p ’ wc lose some 132-pattems and gain some. Since p 
was optimal, the number lost must be at least as large as the number gained. And 
this inequality can be manipulated to give the one desired. 

For the details, the following chart gives the relevant information to describe 
p ’for each of the four inequalities. In the second case, the last two layers of p are 
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combined into one, so the value of b ’ is irrelevant. 


m' 

b' 

number of gained 132-patterns < number of lost 132-patterns 

b 

m 

O < M”) 

b + m 

0 

abm < 6(“) 

m + 1 

6-1 

(a + 6 — l)m < (™) + a(b — 1) 

m — 1 

6 + 1 

( m 2 " 1 ) + ab < (a + 6)(m — 1) 


Now (i) and (ii) follow immediately by cancelling bm from the inequalities in the 
first two rows of the table. From these two, it follows that a(b— 1) < So 
using the third line of the chart 


(k — 1 )m = (a + b — 1 )m < 


+ a(b- 1) <2 


m(m — 1) 


and cancelling m gives (iii). For (iv) we have 

( 2 ) “ ( 2 ^ + a6 < (a + b)(m — 1) = (n — m){m — 1). 

Canceling m- 1 and solving for m completes the proof. 

We now turn to the proof of Theorem 5.43. First note that, by Lemma 5.46 
(iv), we have 

, n — 2 

k = n — m > —-—. 

o 

So clearly diverges to infinity. For our next step, we prove that is 

monotonically weakly increasing. Let p n>1 denote an w-permutation whose last 
layer is of length n-i, and whose leftmost i elements form an optimal z-permutation, 
and let c n =c(p,J. Clearly 

(-‘n.i — 1 * i. T i 

PROPOSITION 5.47 

For all w>l, we have k n <k n+ i. 



PROOF As usual, let k=k„. Then it suffice s to s how that r:„ +1 j > c„ +L , for all i<Ck. 
This is equivalent to showing that 


Mk + k 


n + 1 — k 
2 


> Mi + i 


fn + 1 

V 2 



( 5 . 30 ) 
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However, by definition of k, we know that for all K.k, 

., , In — k\ . . (n — i\ 

M k + k f 2 + 2 J. (5.31) 

Subtracting (5.31) from (5.30), we are reduced to proving the inequality k(n- 
k)>i(n-i). Rearranging terms in order to cancel k-i gives the equivalent inequality 
n>k+i. However, £<w/2 by Lemma 5.46 (iii), and so the bound on i gives 
k+KZk^n. | 

The proof of the upper bound on k„ +l is a bit more involved but follows the same 
general lines as the previous demonstration. Note that this will finish the proof of 
Theorem 5.43. 


LEMMA 5.48 

For all positive integers n, we have k^k^^kNl. 

PROOF We are going to use induction on n. The statement is easy to check for 
2. Suppose we know that the lemma is true for integers smaller than or equal 
to n, and prove it for n+ 1. For simplicity, set k=k„, m=n-k, and ci=c n+u . Since we 
have already proved the lower bound, it suffices to show that 


Ci > Ci+i for k + 1 < i < 


n + 1 
2 


(5.32) 


Note that we do not have to consider j=[(n+l)/2] because of Lemma 5.46 (iii). 
We prove (5.32) by induction on i. For the base case, i=h+ 1, we wish to show 

m — H 


Mk+i + (k + 1) ( j > Mk- j-2 + (k + 2) 


(5.33) 


However, since p n j : is optimal by assumption, we have 


m 


Mk + k ( J > Mk+i + (fc + 1) 


m — 1 
2 


(5.34) 


Subtracting (5.34) from (5.33) and rearranging terms, it suffices to prove 

m-l>(M t+2 -MJ-(M k+1 -M t ). (5.35) 

Let p' G Sk, p" 6 Sk+i, and p'" G Sk+ 2 be layered 132-optimal permutations 
having last layer lengths w!, m", and m!", respectively, as short as possible. Since 
2 and ^<?z/2, we have k+2 and so, by induction, these three permutations 
satisfy the lemma. If m"=m'+l then let xbe the largest element in the last layer of 
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p" (namely x=k+l). Otherwise, rrt'=m' and removing the last layer of both p' and 
p" leaves permutations in and S k . m '+u respectively. We can iterate this process 
until we find the single layer where p’and p" have different lengths (those lengths 
must differ by 1) and let x be the largest element in that layer of p". Similarly we 
can find the element y which is largest in the unique layer were p" and p"' have 
different lengths. 

Now let 

z=the number of 132-pattems inp"' containing neither x nor y, 
j=the number of 132-patterns inp"' containing x but noty, 

^=the number of 132-patterns in f" containing y but not x, and 
zf= the number of 132-patterns in.pt" containing both xandjy. 

Note that there is a bijection between the 132-patterns of p'" not containing y and 
the 132-patterns of p'. A similar statement holds for p" and p'. So 

M h =r, M k+1 =r+s, M k+2 = r+s+t+u. 

Note also that s>l because increasing the length of the layer of x results in the 
most number of 132-patterns being added to p’. It follows that 

(M i+2 M M )-(M k+1 -M k ) =t+u-s<u. 

However, there are only k elements of p’" other than x and y, so u<k<m -1 by 
Lemma 5.46 (iii). This completes the proof of (5.35) and of the base case for the 
induction on i. 

The proof of the induction step is similar. Assume that (5.32) is true for i-\ so 
that 

Mi-i + {i - 1) ^ 2 1 > M t + * f' {5.36) 

where l=n+l-i. We wish to prove 

Mi + i Q > M i+1 + (i + 1) ^ “ X ) . (5.37) 

Subtracting as usual and simplifying, we need to show 

Proceeding exactly as in the base case, we will be done if we can show that 2l-i- 
1 >zT or equivalendy />z. However, this is straightforward because /=zz+l-zand 
* < L(n+ 1)/2J. 

We have seen that there are infinitely many NIZ integers. It is natural to ask 
whether there are infinitely many IZ integers as well. The answer is in the 
affirmative, and can be found in [37]. 
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Exercises 

1. Find an explicit formula for 5l32,l (n). 

2. Prove that the number of ways to dissect a convex (w+l)-gon into n- 2 
parts with noncrossing diagonals is £ 132,1 (n). 

3. +Generalizing the previous problem, find a formula for the number/]??, 
d) of ways to dissect a convex w+ 2 -gon by d nonintersecting diagonals. 

4. Let/] and g„ be two sequences that differ only in a finite number of terms. 
Prove ihat/„ is Precursive if and only if g n is Precursive. 

5. Prove that for any fixed k , the sequence {S(n, k)} n is Precursive. 

6 . Prove that for any fixed k, the sequence {P(n, k) }„ is Precursive. See 
Exercise 33 of Chapter 1 for the definition of P(n, k). 

7. Let A(x) and B(x) be two algebraic power series. Prove that A(x)B(x) and 
A(x)+B(x) are also algebraic power series. 

8 . Let p k (n) be the number of partitions of the integer n into at most k parts. 
Is p k (n) a Precursive sequence? 

9. Prove that f(x)=P\n x is d-finite, but not algebracic. 

10. Prove that 5’,,(3142, 4231) is a Precursive sequence. 

11. Prove that >.S’„(1342, 2431) is a Precursive sequence. 

12. Prove that the sequence f(n)=(n!) n is not Precursive. 

13. Characterize layered permutations by pattern avoidance. 

14. Find the packing density of the pattern q k = \{k + 1) k—2. 

15. Prove that for any nonnegative integer s with $ S ( 2 ) there is a permutation 
p £ S n having s copies of the pattern 21 and no copies of 132. 

16. Let TV be a positive integer. Show that there exists a pattern q and a 
positive integer n so that the frequency sequence (S M (c)) &a contains TV 
consecutive internal zeros. 

17. Let p be an 321-avoiding //-permutation. At most how many inversions 
can p contain? 
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18. Let p be a k- -321 -avoiding n-permutation. At most how many copies of 
q={k- 1)•••321 can p contain? 

19. A k-superpattern is a permutation that contains all k\ patterns of length k. 
Let sp(k) be the length of the shortest A-superpattem. For instance, sp{ 2)=3, 
as 132 is a 2-superpattem of length three, and obviously, there is no 
shorter 2 -superpattem. 

(a) Determine sp( 3). 

(b) Prove that sp( 4) <10. 

(c) Prove that sp(k)<P. 

20. Prove that sp(k)<(k- 1) 2 +1. 

21. (a) Prove that sp( 4) > 6 . 

(b) Prove that sp{f)~>7. 

22. (a) Prove that if k is large enough, then sp(k ) > ^ . 

(b) Prove that for all integers k> 4, we have sp(k)>2k. 

23. Let p € »S'g. Then p has 2 8 =256 subwords. Prove that there cannot be 
more than 127 different subwords among them. 

24. Find 

25. Let r/ be a layered pattern consisting of k layers of length 2 each. Is it true 
that the ^-optimal layered w-permutation consists of k layers of length n/k 
each? (We can assume that n is divisible by k.) 

26. Find all 1243-optimal n-permutations. 

27. +Let <7 be a layered pattern in which all layer lengths are at least two. 
Prove that all ^-optimal permutations are layered. 

28. Prove that for any fixed k, the sequence G(n, k) of the numbers of n- 
permutations with k alternating mns is Precursive. 

29. Let M„ be the number of lattice paths from (0, 0) to ( n, 0) using steps (1, 
0), (1,1), and (1, -1) that never go below the liney= 0 . (These lattice paths 
are called Motzkin paths. Prove that M„ is a Precursive sequence. What 
permutations are enumerated by these numbers? 

30. Let r n be number lattice paths from (0, 0) to (n, n) using steps (1,1), (1, 0), 
and (0, 1) that never go above the diagonal x=y. Prove that r„ is a P 
recursive sequence. What permutations are enumerated by these 
numbers? 
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Problems Plus 

1. Find a formula for $123,1 (n), and prove from that formula that $123,1 (n) is P- 

recursive. 


2 . 

3. 


Prove that 

Sl 32 , 2 (n) = 

Prove that 


2n — 6\ n 3 + 17n 2 — 80n + 80 
n — 4 ) 2 n(n — 1) 


_ 2n \ 59n 2 + 117n + 100 

$123,2 («) 2n(2n — l)(n + 5) ’ 


4. Let n be a partition of the set [n]. We say that the 4-tuple of elements (a, 
b, c, d) is a crossing of n if a<ZKc<d, and a and c are in a block K\ of n, and 
b and d are in a different block n 2 of Jt. Let r be a fixed natural number, 
and let H r (n) be the number of partitions of [n]w ith exactly r crossings. 
Prove that H r (n) is a P-recursive function of n. If possible, strengthen this 
claim. 


5. 


6 . 


Let p be an w-permutation, and let f(p) be the number of all distinct 
patterns contained inj^. For instance, if/>=1324, then f(p)=l for the patterns 
1, 12, 21, 213, 123, 132, 1324. Let pat(n) = max pe s n f(p). Prove that 

pat(n) 

lint -—— > 1. 

n— foo 1.61" “ 

Keep the notations of the previous Problem Plus, and assume for simplicity 
that n=k 2 . Prove that (n) > 2( fc-1 ) 2 


7. Find a 4-superpattern of length nine. 

8. Prove that there exists a Lsupcrpattcrn of length | k 2 . 

9. We call the permutation p a weak /{-superpattern if for all patterns q of 
length k, at least one of q and q-' is contained in p. Prove that there exists 
a weak A-superpattern of length 

10. Prove that the ordinary generating function of the sequence $„(1234) is 
not algebraic. 

11. Can one pattern be equivalent to infinitely many? That is, do there exist 
patterns q and q u q 2 , ••• so that S n (q)=S n (q 1 , q 2 , ■■•)? In order to exclude 
trivial answers, we require that the infinite sequence qi, q 2 , ••• consist of 
patterns of pairwise distinct sizes. 
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12. Let q be a layered pattern that contains exactly two layers, and each of 
those layers are of length at least two. Prove that ^-optimal permutations 
are all layered, and also have exactly two layers. 


Solutions to Problems Plus 

1. This result is due toJ.Noonan, who proved that 

/<») = Wn) = ^( n 2 " 3 ). 

The P-recursive property then follows as 

f(n) n— 1 (2n)! (n + 2)!(n —4)! 2n(2n — 1) 

fin — 1) n (n + 3)!(n —3)! (2n — 2)! (n + 3)(n —3) 

2. This formula was first conjectured in [153]. It was proved by Mansour 
and Vainshtein [148], who found a general method to compute the ordinary 
generating function of the numbers S w fn), for 6. 

3. This formula was also conjectured in [153]. It was proved by Fulmek 
[99], who used a lattice path approach. 

4. This result is due to the present author and can be found in [39]. The 
proof is very similar to the proof of Si 3 2 , r ( n ) being polynomially recursive. 
The generating function is algebraic, and has only one quadratic irrationality, 
namely f \ — Ax. 

5. Let p(n) be the w-permutation 1 n 2 n-l ■■■, with , ^>(0)= 0,^b(l)=l, j&(2)=12, 
/>(3)=132, ^(4)=1423, p( 5)=15243, and so on. Then the value of f(p(n)) 
for these permutations is 1, 2, 3, 5, 8, 13. This suggests that f[p{n+ 2))> 
f(p(n+l))+f(p(n)). 

We are now going to prove this claim. Note that all patterns contained in 
p(n+ 2) start either in their minimal or in their maximal entries. 

(a) The patterns that start in their maximal entries. These patterns are 
clearly contained in p(n+2) even if the initial entry 1 of p(n+ 2) is 
omitted. Then the entry n+ 2 can play the role of the maximal entries 
of these patterns, showing that there are pat(p(n)) such patterns. 
(Removing 1 and n+ 2 from/i(w+2), we get p(n).) 

(b) The patterns that start in their minimal entry. The entry 1 of^(w+2) 
can play the role of the minimal entry for these patterns. Then these 
patterns can continue p(n+l) different ways. Indeed, removing 1 from 
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p(n+2 ), we get a permutation with f(p{n+1 )) different patterns as the 
obtained permutation is the complement of p(n+l). 

Therefore, the sequence/^ (n)) is at least as large as the Fibonacci sequence, 
proving our claim. This construction is due to Herb Wilf. 

6 . This result is due to M.Coleman [61]. The proof is as follows. Let pk be 
the w-permutation 

p k =m. .(/F-l ).1 ( £+1)... (F-k+l) 

For example, 

p=3 6 9 2 5 8 1 4 7. 

In other words, p k consists of k segments, each of which consist of k 
entries that are congruent to one another modulo k. 

We claim that p k has at least ^ different patterns. Indeed, let P k be 
the set of subsequences of p h that 

(a) Contain all k entries divisible by k, and 

(b) Contain all entries of [k]. 

In other words, subsequences in P k must contain the whole first segment, 
and the first element of each segment. This means that these patterns 
must contain these 2k-l entries, and are free to contain or not to contain 
the remaining (M) 2 entries. Therefore, P k consists of 2( fe-1 ) 2 subsequences. 
The reader is invited to verify that these subsequences are all different as 
patterns. 

7. We claim that 519472683 is a 4-superpattern. To verify this, note that the 
last five entries form a 3-superpattem. The entries 1 and 9 are on the left 
of that 3-superpattem, so we certainly have all patterns that start with 1 
or 4. One then verifies that the remaining 12 patterns are also present. 
This construction is due to Rebecca Smith. Note that computer data proves 
that there is no 4-superpattern of length 8. 

8 . This result was proved in [85]. The authors proved the upper bound 

constmctively as follows. Let us consider an chess board, in which, 
like in real chess, the bottom right corner square is white. Let us pick a 
subset S of / squares on this chess board. We will associate a /-permutation 
p(S) to S as follows. From left to right, write the numbers 1 through n in 
the first (bottom) row in increasing order, the numbers n+1 , 2 n, in the 

second row in increasing order, and so on. Then read the entries that 
belong to S column by column, starting with the leftmost column, and 
going down in each column. See Figure 5.3 for an example. 
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FIGURE 5.3 

The chosen squares lead to the subword 48526, then to p(S)=252>\^. 


This will result in a pattern of length t, which then defines a /-permutation 
by relabeling. 

The authors then take the kx [3/2/2] chess board, and choose S to be the 
set of all white squares. They prove that the resulting permutation is in 
fact a Lsuperpattern. This result is not tight. Indeed, for k= 3, the white 
squares of a kxk chess board are sufficient. In fact, it is conjectured in [85] 
that there exists a A-superpattern of length |fe 2 + o{k 2 ). 

9. This result was proved in [85]. Let S=T h be the set of squares weakly 
below the diagonal of the kxk rectangular chess board. It is then proved 
that the permutation p(k) defined by these squares (in the sense of the 
solution of the previous Problem Plus) contains q or q l for each pattern 
q of length k. 

10. The following argument is due to M.Bousquet-Melou (unpublished). Recall 
that Theorem 4.11 says that 

(k — l) 2n 

S„(1234 ■ ■ - k) ~ 2_ 2 fc)/2 ’ 

where A f is a constant given by a multiple integral. In particular, if £>2 is 
even, then the denominator of the right hand side is a polynomial of the 
form rf, where n is a positive integer. However, it can be proved (see for 
instance [86], Theorem D, page 293) that the coefficients of an algebraic 
power series cannot be of this form. So the generating function of the 
sequence iS'„(1234---A) is not algebraic if k is an even number that is larger 
than 2. 

11. Yes. It is proved in [12] that 5„(1342)=5’„(^ 2 , <?3, ■■■)> where 

q= 2 2m-l 4 16 3--2 m 2m-3, 

for 2. 

12. This result was proved in two parts. First, it was shown in [158] that if p 
is a layered ^-optimal permutation, then p has only two layers. The somewhat 
lengthy argument shows that if there exists a layered n-permutation with 
j+1 layers containing k copies of q, then a layered w-permutation containing 
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more than k copies but consisting of only s layers also exists, as long as 
2.) Then, it was shown in [3] that if ry is as specified in the exercise, then 
all ^-optimal permutations are layered. Note that the latter result does not 
need the requirement that q has only two layers, just that all layers of q 
are of length at least two. 
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6.1 The Probabilistic Viewpoint 

In the previous chapters we have enumerated permutations according to various 
statistics. A similar line of research is to choose an w-permutation p at random, and 
compute the probability of the event that p has a given property A. Throughout 
this chapter, when we say that we select an n-permutation at random, we mean 
that each of the n\ permutations of length r are chosen with probability 1 Ini. 

Theoretically speaking, this is not a totally new approach. Indeed, the probability 
of success (that is, the event that p has property A) is defined as the number of 
favorable outcomes of our random choice divided by the number of all outcomes. 
In other words, this is the number of w-permutations having property A divided 
by the number of all w-permutations, which is, of course, n\. Therefore, the task 
of computing the probability that p has property A is reduced to the task of 
enumerating w-permutations that have property p. 

In the practice, however, methods borrowed from probability theory are often 
extremely useful for the enumerative combinatorialist. In this chapter, we will 
review some of these methods. 

In the most direct examples, no specific knowledge of Probability Theory is 
needed. We just need to look at our object from a probabilistic angle. Let us start 
with some classic examples. 


Example 6.1 

Let i and j be two distinct elements of [n], and let p be a randomly chosen n- 
permutation. Then the probability that i and j belong to the same cycle of p is 
1 / 2 . □ 


PROOF For obvious reasons, the probability in question does not depend on 
the values of i and j, so we might as well assume that v=n-1 and j=n. 

The crucial idea is that we can use the Transition Lemma (Lemma 3.39). 
Indeed, entries n -1 and n are in the same cycle of p if, when p is written in 
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canonical cycle notation and the parentheses are omitted, n precedes n- 1. This 
obviously happens in half of all w-permutations, proving our claim. | 


This result can be illustrated as follows. I go to a movie theater where each 
customer is given a ticket to a specific seat. Careless moviegoers, however, do not 
respect the assigned seating, and just take a seat at random. When I go to my 
seat, it can be free, or it can be taken. If it is free, fine. If somebody else has 
already taken it, I ask her to go to her own seat. She does that, and if that seat is 
free, then everybody is happy; if not, then she will ask the person illegally taking 
her seat to go to his or her assigned seat. This procedure continues until the 
person just chased from his illegally taken seat finds his assigned seat empty. 
Then the probability that a randomly selected person will have to move during 
the procedure (that is, that he is in my cycle) is one half. 

The previous example discussed a “yes-or-no” event, that is, two entries were 
either contained in the same cycle, or they were not. Our next observation is 
more refined. 


Example 6.2 

Let i,k 6 [n], and let p be a randomly selected w-permutation. Then the 
probability that the entry i is part of a Ccyclc of p is l In; in particular, it is 
independent of k. G 

This fact is a bit surprising. After all, one could think that it is easier to be 
contained in a large cycle than in a small cycle as there is “more space” in a large 
cycle. 

PROOF We would like to use the Transition Lemma again. For obvious reasons 
related to symmetry, the choice of the entry i is insignificant, so we might as well 
assume that i=n. The bijccrion f of the Transition Lemma maps the set of 
w-permutations in which n is part of an T-cycle into the set of w-permutations in 
which f)„ + i_i=n. The latter is clearly a set of size (n- 1)!, and our claim is proved. | 


In the context of the explanation following the previous example, this means that 
it is just as likely that i people in the movie theater have to move as it is that j 
people have to move, no matter what i and j are. 


6.1.1 Standard Young Tableaux 

Our simple examples in the previous subsection might have given the impression 
that a proof using elementary probability arguments is necessarily simple. This is 
far from the truth. We illustrate this by presenting two classic proofs given by 
Greene, Nijenhaus and Wilf [115]. 
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FIGURE 6.1 

A Standard Ymng Tableau on ten boxes. 


A Standard Young Tableau is a Ferrers shape on n boxes in which each box 
contains one of the elements of [n] so that all boxes contain different numbers, 
and the rows and columns increase going down and going to the right. 
Standard Young Tableaux have been around for more than one hundred 
years by now, being first defined by the Reverend Young in a series of 
papers starting with [204] at the beginning of the twentieth century. Figure 
6.1 shows an example. 

Given this definidon, a real enumerative combinatorialist will certainly not 
waste any time before asking how many Standard Young Tableaux exist on a 
given Ferrers shape, or more generally, on all Ferrers shapes consisting of n 
boxes. Fortunately, we can answer both of these questions, which is remarkable 
as we could not tell how many Ferrers shapes (that is, partitions of the integer 
n) exist on n boxes. Standard Young Tableaux are very closely linked to 
permutations. On one hand, the entries of a SYT determine a certain 
(restricted) permutation of [n]. On the other hand, as we will see in the next 
chapter, there are beautiful connections between SYT on n boxes, and the set 
of n\ permutations of length n. All the above motivate us to discuss SYT in 
this book. 


6.1.1.1 The Hooklength Formula 

DEFINITION6.3LetFbe a Ferrers shape, and letb be a box ofF. Then thehook. ofb 
is the setH b of boxes in Fthat are weakly on the right ofb (but in t!u same row) or weakly below 
b (but in t!u same column). The size ofH b is called the hooklength ofb, and is denoted by h b . 
Finally, the box b is called the peak of H b . 


See Figure 6.2 for the hook H b of b and the hooklengths associated to each box 
of a Ferrers shape. 

The number of SYT of a given shape is given by the following classic 
theorem. 


THEOREM 6.4 Hooklength Formula 

Let F be any Ferrers shape on n boxes. Tlun the number of Standard Yung 
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FIGURE 6.2 

A hook and all the hooklengths of a Ferrers shape. 
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FIGURE 6.3 

The five SYT of shape F= 2x3 and the hooklengths of F. 
Tableaux of shape F is equal to 

nl 

n 6 h ’ 

where the product is over all n boxes b fF. 


( 6 . 1 ) 


Example 6.5 

Let F be a 2x3 rectangle. Then there are five Standard Young tableaux of 
shape F, and the hooklengths of Tare, row by row, 4, 3, 2, 3, 2, 1. So the 
Hooklength formula is verified as 4.3.2*3. 2 I = iff = See Figure 6.3 for an 
illustration. 0 


This is one of those theorems that looks unbelievable at first sight, then believable, 
and then unbelievable again until a proof is completed. Indeed, at first sight it is 
not even obvious that the right-hand side of (6.1) is an integer. 

On a second thought, we could argue like this. Let us write in the entries of [n] 
into Tin some random order. The obtained tableau will be a Standard Young 
Tableau if and only if each hook has its largest entry in its peak. The probability 
of that happening for a given hook H b is T, so f we can multiply these probabilities 
together, we are done. 

Unfortunately, this is too big of a leap in the general case. Let A b be the event 
that H b has its largest entry at its peak. While it is true that T[Aj] = it is in 
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FIGURE 6.4 

Events A b and A, are not independent. 


general not true that P[At, fl A c ] = P[Aj,] • P[A C ]. In other words, the events A b 
and A c are not independent’, the occurrence of one can influence the occurrence of 
the other. To see this, consider the Ferrers shape shown in Figure 6.4, with the 
boxes b and c marked. In the unlikely case that the reader has not seen an 
introductory probability text on independent events, it is very easy to catch up 
by consulting such a text. As we only need discrete probability in this book, we 
recommend [112]. 

Clearly, we have P[At,\ = P[A. C \ = I, but P[A(, fl A c ] = | as that is the 
chance that the element x in the bottom right corner is the smallest of the three 
elements involved. This kind of dependence of the events A b will always be 
present, unless Titself is a hook. Therefore, in the general case, this argument 
will not work. And this is why our theorem looks so unbelievable again. What 
we have to prove is that when everything is taken into account, we do have 

P[A b eFA b ] = Fife TTk after all. 


PROOF We will define an algorithm that generates a random SYT on F. We will 
show that each Standard Young Tableau T has 

P[T] = ( 6 .2) 

chance to be generated by our algorithm. As our algorithm always stops by 
producing a SYT, these probabilities have to sum to 1, meaning that there have 
to be Yl^hh °f them, proving that this is the number of SYT on F. 

The algorithm proceeds by first placing the entry n somewhere in F, then 
placing the entry n -1 somewhere else in F, and so on, until all entries have been 
placed. 

The entry n is placed as follows. First, a box in A denoted by q is chosen at 
random. (Here, and throughout this proof, all eligible boxes have the same 
chance to be chosen.) Then, q is moved to any of the other A ? -l positions of the 
original hook H r Call this new box q now. Then repeat the same for the new q, 
that is, move it to a different position within the new H q , and call that box q. 
Continue this until q becomes an inner comer, that is, a box whose hook consists of 
one box only, namely q itself. When that happens, place the entry n into q. 
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FIGURE 6.5 

Random choices leading to the placement of n. 


Example 6.6 

Figure 6.5 shows a possible hook walk to the placement of n=\ 0 in our 
Ferrers shape. The first choice has 1/10 of probability, the second one has 1/5, 
and the third one has probability 1/3. Therefore, the probability that this particular 
hook walk will be chosen when we place n is pC [] 


Once n has been placed, temporarily remove the box containing n from F to get 
the Ferrers shape F h and repeat the algorithm with n -1 playing the role of n. 
Continue this until all elements of [n] are placed and a SYT is obtained. It is 
obvious that this algorithm always produces a SYT. 

We promised to prove that all SYT on Tare obtained with the same probability 
by this algorithm. We prove this statement by induction on n, the base case of 
«=1 being trivially true. 

Let T be a SYT having shape F that contains the entry n in box q. Removing 
that box, we get the Standard Young Tableau 1\ that has shape F } . It is then clear 
by the structure of our algorithm that 

P[T]=I\T^p[q], (6.3) 

where P[q] is the probability that n gets placed into box q, while P[F] and 1\ 7|] 
are the probabilities that the mentioned tableaux are obtained by our algorithm. 
As 7i is a SYT on n -1 boxes, by the induction hypothesis, we have 

Plri1 = (64) 


Comparing equations (6.3) and (6.4), we see that our main claim (6.2) will be 
proved if we can show that 


P[q\ 


1 

n 


n 

cGL, 



(6.5) 
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FIGURE 6.6 

The projections / and J. 

where L q is the set of boxes that are in the same line (row or column) as q, but are 
different from q. Indeed, if d (f L q , then the removal of q does not affect the 
hooklength h d , so h d will be part of both the numerator and denominator, and will 
therefore cancel. Otherwise, the removal of q will decrease h d by one, explaining 
the remaining terms of the last equation. 

In order to better understand what P[q] is, we decompose it as a sum of 
several summands corresponding to the various sequences leading to q. If we 
have a sequence of random choices leading to the placement of n into q=(x, y), 
then we denote by /the set of rows (not including x) and by J the set of columns 
(not including y) that contained the moving box q at some point during the 
sequence. Then /and J are called the horizontal and vertical projection of that sequence. 

Let Pij[x,y] be the probability that a random sequence of choices leads to 
q=(x,y) and has horizontal projection /and vertical projection J. 


Example 6 .7 

Let /’be as shown in Figure 6.6, and let q=(2, 3). If /={1} and J={2), then 
there are two possible sequences ending in q, one is (1, 2) —> (1, 3) —> (2, 3), and 
the other is (1, 2) —> (2, 2) —> (2, 3). We have seen in Example 6.6 that the first 
sequence has p=Q probability to be selected, whereas the second one has probability 
To ' \ \ = TUo t0 t )C selected. This yields 


Pi,A x >y] 


i i _ i 
150 + 100 _ 60 


□ 


It is now obvious that 


I,J 


( 6 . 6 ) 


It is time that we proved (6.5). Note that we can rearrange (6.5) as follows. 

™-in(^)=U.(4)..n .K)' 


CGi'qr 


Ki<x —1 




(6.7) 
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where we split the set L q into two parts, those boxes that are in the same row as 
q and those boxes that are in the same column as q. So a, is the hooklength of the 
box (i,y) decreased by one, and b { is the hooklength of the box (x,j) decreased 
by one. 

We are going to prove (6.7) by showing that each expansion term of the last 
expression is equal to a suitably selected Pij[x,y], Then (6.7) will immediately 
follow by (6.6). 

In fact, we claim that the following holds. 


LEMMA 6.8 

Let x artdy be the coordinates of an inner corner q, and let I C [x — 1] and J C [y — 1]. 
Then we have 


Pij[x,y\ 


1 

n 



iei jeJ 


1 

bf 


Example 6.9 

Let F be the same shape as in Example 6.7, let x=2, and y=3, and let /=( 1}, and 
J={ 2}. Then we need to compute the product py • yy y;. As we said after equation 
(6.7), «i is the hooklength of the box (1, 3) decreased by one, that is, a x = 4-1=3. 
Similarly, b 2 is the hooklength of the box (2, 2) decreased by one, that is, b 2 = 3- 
1=2. This yields | ' | = <j5y This is what we expected as in Example 6.7 we 
have computed that P,^j[x, y] = F- Q 

PROOF (of Lemma 6.8) In this proof, it will be advantageous to denote a box 
by its coordinates (a, b) rather than just by one letter, and consequendy, hooks 
and hooklengths will also be denoted by H al and h„, r 

Assume first that J is empty. That means that the hook walk leading to q 
started in row y, and therefore its horizontal projection / completely determines 
it. So in this case, Pij[x,y] is equal to the probability of that single hook walk, and 
that is obviously 1 [], e/ f. We argue analogously if /is empty. 

Assume now that both / and J are nonempty. In this case we prove our 
Lemma by induction on |/]+|Jj. The initial case is when /={z} and J=\j], and this 
case is easy to verify. 

To prove the inductive step, note that if a hook walk has /={/, ^ i t \ as its 
horizontal projection, and "'i/s} for its vert ical projection then it must 

start at (i\jf Then it must continue either to (4,y"i) or to (i\,j 2 )- The hook walks 
from either of these two points to (x, y) have a truncated horizontal or vertical 
projection, that is, /—/-{/} or }, therefore the induction hypothesis applies. 

If A is the hooklength of the hook H of (z), f), then it is immediate from the 
definitions that 
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FIGURE 6.7 

The gray hooks intersect in one box. 


PlA x M 


1 

ft - 1 


(Pi>,j[x,y] + Pj,j<[x,y ]) 


but by the induction hypothesis this implies 


Pij[x,y] = 


1 


1 


h ~ l « yUieJ^Ujtjbj 


+ 


A 


rite/ a i n ieJ 


_ &i 1 + bj x 1 

f* - 1 n n i€ j ai ■ n,ej bj 

Finally, note that the first fraction of the last row is equal to one. Indeed, by 
the definition of the «, and ty, we have 


T] T bj. — ft*i,y 1 T bxyji 1 — ft 1; 

as | Hi lt y U H x j j | = |ftf| = ft, but the left hand side has an intersection of size 
one, the box q (see Figure 6.7). So the lemma is proved. 

I 


This proves (6.7) and therefore the equivalent (6.5), which in turn implies our 
main claim (6.2). | 


6.1.1.2 The Frobenius formula 

The other spectacular result on the enumeration of Standard Young Tableaux is 
the Frobenius formula, which also dates back to the beginning of the twentieth 
century [96], [97]. 


THEOREM 6.10 

[The Frobeniusformula] For a Ferrers shape F, letf F denote the number of Standard Young 
Tableaux that have shape F. Then for any positive integer n, we have 


E (f F ) 2 = A 


( 6 . 8 ) 
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FIGURE 6.8 

The Ferrers shapes on five boxes and their/Walue. 

where the sum on the left-hand side is taken over all Ferrers shapes on n boxes. 

Example 6.11 

Let w=4. Then there are five SYT on n boxes, and the values of / 7 ' for these 
SYT are 1, 1, 2, 3, 3 as shown in Figure 6.8. So the Frobenius formula is verified 
as l 2 +l 2 +2 2 +3 2 +3 2 =24. □ 

The Frobenius formula is even more surprising than the hooklength formula. It 
has a nice and simple bijective proof, which we will cover in Section 7.1. It also has 
various algebraic proofs. Flere we will show a probabilistic proof that is again due 
to Greene, Nijenhaus and Wilf [116]. 

The oudine of the proof will be as follows. We will define a random procedure 
that produces a random SYT on n boxes. Note that we do not know in advance 
what the shape of the obtained SYT will be. Then we prove that our procedure 
produces any given SYT of shape F with probability Therefore, the total 
probability that we get a tableaux of shape F is f T . Th = O ) , proving that 



F 


as claimed. 

We will need the notion of conditional probabilities. 


DEFINITION 6.12 Let A and B be two events so that P[B]> 0. Then we define 
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and we call P[A \B] the probability of A given B. 


In other words, P[A \B]\s the probability of the occurrence of A if we assume that 
B occurs. As we mentioned before, A and B are called independent A P[A \B]=P[A], 
that is, if the occurrence of B does not make the occurrence of A any more or less 
likely 

One basic and well-known application of conditional probabilities is Bayes’ 
Theorem, which is also called the law of total probability. It can be found in any 
introductory probability textbook, such as [112]. It states the following. 

THEOREM 6.13 

Let A C Ll be an event, and let X u X 2 , X m be events in fl so that the X, are pairwise 
disjoint, and X\ U X 2 LJ • ■ ■ U X m = fl. Then we have 


m 

P(A) = '£P(A\X i )P(Xj. 

i =1 

In the proof of the hooklength formula, we considered various probabilities 
that a hook walk ends in a particular box. Now we will be looking at conditional 
probabilities P[ab\xy] that a hook walk ends in a given box (a, b) provided that it 
went through another given box (x,y). 

We want to prove some general facts, preferably explicit formulae, about 
P[ab\xy]. The first step is the following proposition that provides these formulae 
for hook walks that lie within one row or column. 


PROPOSITION 6.14 

Let x > a and lety > b. Then we have 

P[xy\ay\ = JJ 

a<i<x 'Xy 1 

and similarly, 

P[xy\xb} = JI ~ 
b<j<y xJ 


(6.9) 


( 6 . 10 ) 


PROOF If a hook walk passes through (a, b) and goes to (x,y), then its first stop 
after (a, b) can be any of the other h a h -\ boxes of H a b , after which it can proceed 
to (x, y) in many ways. Therefore, by Theorem 6.13, 


P[xy\ab\ = 


foa.l 


p \ x v\ ib ] + p l x v\ a i ] j ■ ( 6 . 11 ) 

^ a<i<x b<j<y 
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If b=y, then the second sum is empty, and we get, after rearrangement, 
(h a , v ~ l)P[xy\ay\ = ^ P[xy\iy\. 

a<i<x 


( 6 . 12 ) 


In order to get rid of the factor h a , y - 1, we apply the following clever trick [116]. 
We take (6.12) for a+ 1, instead of a. Then we get 

(h a+ i, y -l)P[xy\(a+l)y\= ]T P[xy\iy], (613) 

a+l<i<x 

Subtracting (6.13) from (6.12) we get 


(■ K, y ~ ^)P[xy\ay] - (h a+ i tV - l)P[xy\{a + 1 )y] = P[xy\(a + 1 )y], 
P[xy\ay] = -~^f- P[xy\{a + 1 )y\. 

•la,y A 

So we have obtained a formula for P[xy\ay] that is the product of P[xy\(a+ 1 )y] 
and another term, namely Applying the same procedure for P[xy\(a+V)y], 

then P[xy\(a+2)y], and so on, we will be left with the product of fractions, and 
get (6.9). 

The proof of (6.10) is analogous. 


A direct consequence of this Proposition is the following, surprisingly compact 
expression for P[xy\ab]. 


COROLLARY 6.15 

Let x > a and y ' b, then we have 

P[xy\ab]=P[xy\ay]-P[xy \xbj. (6.14) 

PROOF Let Abe the rectangular Ferrers shape with northwestern and southeastern 
corners (1,1) and (x,y), and let S C F be the subset of boxes (a, b) in FTor which 
(6.14) is not proved yet. So, at the beginning, S= F-(x, y). Let (a, b) be an inner 
corner of S. We will now prove that (6.14) must hold for (a, b). 

Let us look at (6.11) again. In the first summation, we have a<z<bc, therefore 
we are below the box (a, b), that is, we are in P S, and so we can apply (6.14) to 
replace the terms P[xy\ib]by the terms P[xy\iy]P[xy\xb]. This leads to 

y, P[xy\ib\ = P[xy\iy]P[xy\xb] = P[xy\xb} P[xy\iy\ (6.15) 

a<i<x a<i<x a<i<x 

= P[xy\xb]P[xy\ay}(h at y - 1) (6.16) 
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FIGURE 6.9 

A shape and its complementary shape. 


where in the last step we simply used (6.12). 

If we transform the second summation in (6.11), we get that 

Y2 p [ x y\ a i] = p [ x y\ a y]P[ x y\ xb \( h x,b - 1 ). (6.17) 

b<j<y 

Finally, we replace the two sums in (6.11) by the expressions we just computed in 
(6.15) and (6.17) to get 


or i a p i x y ay} p [ x y xb ]( h a, y - i + **,& -1 

P[xy\ab\ = - - - y - - 

ri a ,b ~ 1 


P[xy\ay}P[xy\xb\, 


as it is clear that h m +h x b -l=h a b . See Figure 6.7 for an illustration of this. 

Now that we proved our statement for (a, b), we can remove (a, b) from S to 
get S', and choose an inner corner of the new shape F-S'. We can then repeat the 
whole procedure until (6.14) is proved for all boxes of F. 


In order to continue our proof of the Frobenius formula, we need some new 
notions. Let F’be the Ferrers shape of a partition \={a l , a 2 . ..., ak). Recall that 
these parts are written in non-increasing order. Choose p and q so that pi>k and 
q>ai. Then the (p, q)-complementary board Fp q is the set of boxes that are in the 
rectangle spanned by (1, 1) and (p, q), but not in F. 


Example 6.16 

Let F’be the Ferrers shape of (5, 4, 3), and let (p, q)=( 4, 6). Then F'\s shown in 
Figure 6.9, and F vq is represented by the shaded boxes. Q 

Note that the outer comers of F, that is, the boxes that we can add to F to get 
another Ferrers shape, are the same as the inner comers of F£, q , that is, elements 
that can be removed from Fp,q. For the rest of this proof, if an outer corner of 
Fp q is denoted by ft, and the box immediately on its left is an inner corner of F, 
then such a box will be denoted by K. 

We define a special complementary hook walk in the complementary board Fp q 
as a walk that starts at (p, q) (hence “special”), and uses steps north and west 
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(hence “complementary”). These walks can stop at any of the inner corners 
of Fp,q. 

Finally, we define the distance between two boxes in the obvious way, that is 
d((x, y), (v, z))=\x-v\ + \y-z\. 

In what follows, we will be interested in computing the probability that a special 
complementary hook walk stops in a given inner corner of Fp,q- The alert reader 
probably suspects that we are interested in this because our random procedure 
that will produce each SYT with the correct probability will be defined recursively, 
by ensuring that it puts the maximal element in a certain corner. The alert reader 
is right, of course. 

The following Lemma shows that the choice of p and q is not that crucial. 


LEMMA 6.17 

Let\=(a u ^ 2 , a/) be a partition ofn corresponding to t!w Ferrers slmpe F, and letp~>k and 
q>a x . Then the probability that a special complementary hook walk in F£ will end at inner 
comer K of Ff q is 


- = YUdiKjf 
1 J u R d(K,Ry 


(6.18) 


where R ranges over all inner corners ofFf q , and R ranges over all outer comers of F. 


Note that this means in particular that the probability P[K] is indepenelent of the 
choice of p and q. Indeed, the inner corners of Ff q are determined by the outer 
corners of F, and so neither the numerator nor the denominator of the right- 
hand side of (6.18) depends on p and q. 


PROOF First, we will prove our statement for a regular (not a complementary) 
Ferrers shape. So instead of special complementary hook walks, we will be looking 
at hook walks starting at (1, 1). 

Let K=(x,y) be the corner in which we want our hook walk to finish. Then we 
need to compute the probability i^xylll]. Applying (6.14), then (6.9) and (6.10), 
we get 

P[xy\ll\ = P[xy\ly\P[xy\xl\ = ] f Jf+hlL . llx d+i . 

l<i<x t,y 1 <j<y 

Taking a closer look at the fractions on the right-hand side, we see that many of 
them are equal to 1. For instance, look at the generic term The horizontal 

parts of these hooks end in (x,y). The vertical parts are different if and only if the 
/ill column of our shape ends in a corner R. It is in this case, and only in this case, 
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FIGURE 6.10 

A generic term that does not cancel. 


that F 1. See Figure 6.10 for an illustration. In this case, the denominator 

is equal to d(K, R), whereas the numerator is equal to d(K, R), where R is the 
box just below the end of column j+ 1. 

If we change our shape to a complementary shape, our hook walk to a special 
complementary walk, and K to K, we get the statement of the lemma. | 


Example 6.18 

Considering the Ferrers shape in Figure 6.10, let K=( 3, 4), and let us look at the 
term , hz ' 2 , . The value of this term is 3/6. On the other hand, we have 
d(K. R) = 6 and d(K, R) = 3, verifying our argument. Q 

The previous lemma provided a formula for P[K ], but it was a rather complicated 
one. The following lemma will bring that formula closer to what we need. 

LEMMA 6.19 

Let F be a Ferrers shape on m boxes, and let F’be a shape obtained from F by adding an outer 
corner K. Then we have 


P[K} = 


eF 1 


(m + l )/ 1 


PROOF We can express both f F and / 7 ' by the hooklength formula. After that, 
ml cancels, and so do all hooklengths that belong to hooks not being in the same 
row or column as K- 

What happens with the remaining terms? Let K = (a:, y). Looking at the row 
of pg. we see that the hook H r] of F is as long as the hook H' x - +1 of F ', unless the 
vertical parts of these hooks are different, that is, unless there is a corner R at the 
end of column j of F. In that case, instead of a cancellation, we get a 
d{K , R)/d(K, R) factor. An analogous argument applies for the column of K- 
However, by Lemma 6.17, the product of these terms d(K. R) j d(K. R) is precisely 
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FIGURE 6.11 

How corners cause non-cancelling terms. 


P\K], proving our claim. 


Example 6.20 

Let F be the Ferrers shape of the partition (5, 4, 1), and let K = (2,5). The 
reader is invited to verify using Figure 6.11 that the hooks of L and F’ have the 
same length if the peak of the hook is not in the second row and not in the fifth 
column. The reader is also invited to verify cancellations involving h, . of L and 
h x ,j +1 F , provided that column j of 7’docs not end in a corner. 

If 7=1, then column j of Tends in corner R. Then we have h iA =r> in 
F and h'. 2 2 = 4 in F' in F’. On the other hand, we have d(K. R) = 5, and 
d(K, R) = 4, verifying our argument. [] 


Now we are ready to describe the random procedure that will produce each SYT 
of shape F with probability /'/^!, for all shapes Ton n boxes. 

Choose p and q so that they are larger than n. We construct a series of SYT 
T 0 , 7\, —, T„ so that 7”has z boxes as follows. Obviously, T 0 is the empty tableaux. 
For shortness, let %% = the complementary shape of 7) in the rectangle 
spanned by (p, q). For i > 1, we get 7) from 7/ by inserting the entry i to one of 
the outer corners of 7/. The crucial question is which outer comer ? This corner is 
chosen by taking the endpoint of a special complementary walk in Z lA . 


THEOREM 6.21 

Let T be a Standard Young Tableau on n boxes having shape F. Then t!w random procedure 
defined in the previous paragraph produces T with probability f F /n!. 


PROOF We prove our statement by induction on n, the initial case being trivial. 
Let 7 * be the SYT that we obtain from Tby omitting n, and let F* be its shape. 
Then, by our induction hypothesis, the probability that our algorithm constructs 7 ’ 
is P[T i/y/C^wT)!. When T is constructed, first 7*must be constructed (and the 
probability of this is J^tyn-l )!), and then a randomly chosen special complementary 
walk in the complementary board of T* has to end in the corner K = T — T*. 
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The probability of the latter is, by Lemma 6.19, P[K] = f F /n ■ j F ". Therefore, 
we have 



P[T} = P[T*]P[K] 


(n — 1)! n ■ f F * n\ 


completing the proof. 

As we have explained after Example 6.11, Theorem 6.21 immediately implies 
Theorem 6.10. 


6.2 Expectation 

We need a litde more machinery in order to use stronger probabilistic tools. First 
of all, we formalize the common sense notion of probability we used in the 
previous section. 

DEFINITION 6.22 Let O be afinite set ofoutcomes of some sequence of trials, so that all 
ofthese outcomes are equally likely. LetB CO. Then we call id a sample space, and we call 
B an event. The ratio 



is called the probability ofB. 

A random variable is a function X:Ll —»R that associates numbers to the elements 
of our sample space. Most of the random variables we are going to work with 
will have a finite range, that is, the set of values they take will be finite. The sum 
and product of two random variables, and a constant multiple of a variable is 
defined the way that is usual for ordinary functions. As the reader has probably 
noticed, this book is mostly devoted to permutations, therefore our sample space 
11 will most often be S„, and the random variable X will most often be some 
permutation statistic. 

One of the most important statistics of a random variable is its expectation, 
defined below. 

DEFINITION 6.23 Let Tfl^R be a random variable that has a finite range. Then the 
number 


£(X) = P(X = i) 


ies 
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is called the expectation of X on Cl. 

Other names for E(X) include “expected value,” “expected number,” “mean,” 
“average,” or “weighted average.” The latter is meant with the individual 
probabilities as weights. If there is no danger of confusion as to what the variable 
X is, its expectation E(X) is sometimes denoted by ft. 

For certain random variables, the expectation is easy to compute directly. This 
is often the case for variables defined by symmetric permutation statistics. 


Example 6.24 

Recall that d(p) is the number of descents of the permutation p. Let X: S„— >R 
be the random variable defined by X(p)=d(p)+ 1. That is, Xcounts the ascending 
runs of p. Then E(X)=(n+l)/2. Q 


PROOF Recall that A(n, i) denotes the number of w-permutations with z-1 descents. 
Directly from the definition of E(X), we have 




i =1 

Noting that A(n, i)=A(n, n+l-i), we see that 

Yl 1 

iA(n, i) + (n — i + 1 )A(n, n + 1 — i) = ——— • (A(n, i) + A(n , n — i + 1)), 


and the proof follows after summation on i. | 


6.2.0.3 An Application: Finding the maximum element of a sequence. 

The fact that X was symmetric played a crucial role in the above argument. 
Nevertheless, there is another interesting general phenomenon of which the above 
example was a special case. Recall from Chapter 1 that the Eulerian numbers 
A(n, k) are not only symmetric, but also unimodal, therefore the sequence A(n, k) 
has either its one or two maxima in the middle. 

Recall that the Eulerian polynomials A n (x) are defined by the equation 
A n (x) = A(n,k)x k . Therefore, A' n (x) = ]T^ =1 kA(n,k)x k ~ 1 .and 

n 

A' n (l) = Y / kA(n,k)=n\E(X). 

k =1 

In other words, E(X) = Noting that we did not use anything 

specific about the Eulerian polynomials, we conclude that the expectation of a 
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permutation statistic on S„ can be obtained by substituting 1 into the derivative of 
the relevant generating function, and dividing the result by n\. 

Let A(x) = j a t x i be the ordinary generating function of some sequence 
«!, a 2 , —, a m that enumerates w-permutations according to some statistic. The 
expression A\l)/n\ or ^ may ring a bell for the alert reader. Indeed, recalling 
Darroch’s theorem (Theorem 3.25), we remember that if A(x) has real zeros 
only, then its sequence of coefficients has at most two maximal elements, and 
these are at distance less than one from TLL-. So we have proved the following 
corollary of Darroch’s theorem. 

COROLLARY 6.25 

Let s be a permutation statistic on S n , and let di = |p 6 S n : s(p) = i\. Let X be the 
random variable corresponding to s. Assume that the ordinary generating function 
A(x) = XliLi a i x ' has rea l zeros on ly- Then the sequence a x , flj, —, a n has either one or two 
maximal elements, and they are at distance less than one from E(X). 


In other words, the expectation of X does not only help us to understand the 
average behavior of X, but also (if the real zeros condition holds) provides near¬ 
perfect information about the location of the maxima of X. 


6.2.1 Linearity of Expectation 

One of the reasons for which the expectation of a variable is a very useful 
statistic is the following theorem. 

THEOREM 6.26 

Let X and Y be two random variables defined over the same finite sample space 1L. Then 

E(X+Y)=E(X)+E(Y). 

PROOF Let r£(l, then by definition we have X(r)+Y(r)=(X+Y)(r), so 
X(r)P(r)+Y(r)P(r)=(X+Y)(r)P(r). Addingthese equations for all r € f2, we get 

E(X + Y) = £(X + Y )(r)P(r) = £ X(r)P(r) + £ Y(r)P(r) 

refl re o reo 

= E(X)+E(Y). 


I 


Note that we have not used anything about X and Y other than they are 
defined over the same sample space. It did not matter how (if at all) they 
were related to each other. In other words, the mean of the sum of two variables 
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is insensitive to dependency relations, in case you were wondering about the 
chapter title. 

We also point out that if a is a positive constant, then it is easy to prove that 
E(aX)=aE(X). This fact, together with Theorem 6.26 is often referred to by 
saying that it is a linear operator. 

The application of Theorem 6.26 often involves the method of indicator random 
variables as in the following example. 

Example 6.27 

Let n > 2. The expected number of 2-cycles in a random w-permutation is 

1 / 2 . □ 

PROOF Let X:S n ->R be the random variable giving the number of 2-cycles of a 
permutation. Let i and / be two distinct elements of [n], and let R be the 

random variable defined by 

{ 1 if i and j form a 2-cycle in p , 

0 otherwise. 

It is then easy to compute the expectation of Xij. Indeed, 

E{X id ) = 0-P(X u = 0)+l= 1) - P(X U = 1) = ^ 

(6.19) 

The observation that makes the variable X tJ - useful for us is that summing Xij(p) 
over all 2-element subsets (i,j) C [n] we get X(p). Therfore, by Theorem 6.26, 
we obtain 

e(x) = e ( y, X >A = E = b 

\(iJJEW / («)CM V 7 

as the choice of i and / is clearly insignificant. 

The variables X tJ , or in general, variables taking values 0 and 1 depending on the 
occurrence of an event, are called indicator random variables, and are often very 
useful. 

The following theorem is a not very subde, but very general, tool in proving 
that it is unlikely that a variable is larger than a multiple of its expectation. (We 
will state the theorem in a special case that is relevant in our combinatorial 
applications, but a more general treatment is possible.) 

THEOREM 6.28 

[Markov s Inequality] Let X be any nonnegative random variable that has a finite range, and 
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let a>0. Then we have 


P[X > a/i] < —, 
a 

In the next section we are going to prove a very similar inequality (Theorem 
6.34), which is often called Chebysev’s inequality. The ambitious reader is invited 
to wait for the proof of that inequality, then to try to prove Markov’s inequality 
in a similar manner. Then the reader can check the proof that we will provide as 
the solution of Exercise 13. 

Markov’s inequality is particularly useful when the expectation of a variable is 
very low compared to its maximum value. 


Example 6.29 

Let a> 1. Then the number of w-permutations with more than a\i\{n+i) cycles is 
less than nila. 


PROOF This is immediate if we recall that by Lemma 3.26, the average n- 
permutation has H(n) = ]L” =1 \ < ln(n + 1) cycles. 

Note that this example is particularly striking when n is a very large number. Let 
n be large enough so that n is roughly equal to 10000 ln(w+l). Let a=100. Then 
the probability that a randomly selected w-permutation has more than 100 ln(w+l), 
or in other words, roughly ?z/100 cycles is less then 1/100. Speaking in more 
general terms, it is very unlikely that a randomly selected w-permutation will 
have at least cn cycles, regardless of how small the positive constant c is. 


6.3 Variance and Standard Deviation 

While the expectation of a random variable X contains information about the 
average behavior of X, it does not describe how much the different values of X 
can differ. A constant variable can have the same expectation as one that does 
not take the same value twice. If we want to obtain information about the behavior 
of X from this more subtle viewpoint, we must use more subtle operators, such 
as variance. 


DEFINITION 6.30 Let X be a random variable. Tlwn 


is called the variance ofX. 


Var(X)=E((X-E(X)) 2 ) 
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Example 6.31 

Let Xhc defined as in Example 6.27, and let n>4. Then we have 

V arm = T. 


D 


PROOF The following observation that is immediate from the linearity of 
expectation is often useful in variance computations. 

Var(X)=E(X 2 )-2E(X)E(X)+E(X) 2 =E(X 2 )-E(X) 2 . (6.20) 

In our case, Example 6.27 shows that E(X) 2 =1I4:. So all we have to do is to 
compute E(X). 

In order to accomplish that, define the indicator random variables X tJ - as in 
Example 6.27. Note that Xfj = Xij, and therefore, E(Xf^) = E(Xij). Then 
we have 


E(X 2 ) = E 


(/ \ 2 \ 

E ) = E E ( x x 

y (i,i)C[n] 

EiXtjXrj), 


y \(»d)c[«3 
= E(X) + ^ 


E E ( x ij x i'j 
{•'•il/h'd'l 


{i.i} n£i',i'>=0 


where the second term is explained by the fact that no entry of p can be part of 
more than one cycle. 

Finally, we have to compute E(X,jX i 'j) in the case when (i, j) and (i’,j) are 
disjoint sets. In this case, we have P(X ij X i ’ J = l)=(nX)\ln\, therefore 
E(Xi t jXi‘ ji) = ^ n n ^' ■ On the other hand, the number of (ordered) disjoint 
pairs of 2-element subsets of [n] is ( 2 ) (”2 2 ) Substituting these into the last 
equation, we get 


E(X 2 ) = \ + 


(■n — 4)! /rt\ 


n ■ 


3 

4' 


Therefore, 

Var{X) = E(X*) - E{Xf = \- Y -= 1 -. 

I 


Note that if 2<w<4, then fer(i¥)=l/4, since in that case, p cannot have two 
disjoint 2-cycles, and the term Q) 2 ) equals 0, not 1/4. Finally, for n= 1, 
we obviously have E(X)=Var(X)= 0. 
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Example 6.32 

Let Y(p) be the number of descents of the npermataEon p=pip 2 . ■ -p n , and let 2. 

Then we have 


□ 


Var(Y) = 


n + 1 

12 ' 


PROOF Define the indicator variable Y { :S n — »R by 


Yi(p) = 


1 if Pi > Pi+i , 


0 if not. 


It is then clear that Y% = Y, that E(Y)= 1/2, and that E(Y 2 ) = 1/2. It is 
immediate from Example 6.24 that E(Y)=(n-l)l2. Therefore, we have 

Var{Y) = E(Y 2 ) - E(Y) 2 = E(Y 2 ) - ^ n ~ ^ 

and 

E{Y 2 ) = E [ Y^\ ] = J2 E{ y?) + E 2E ^ Y i) 

y \i= i / J i=l i<j 

= ^ + E 2E ^ Y i) + E 2 tot+i)- 

i<j—l i =1 


In the [ n 2 2 ) pairs where i</-1, the probability that i and j arc both descents is 
1/4. In the pairs where j=i+ 1, this probability is 1/6 as only one of the six 
possible patterns on the entries p„p,+i, P ,+2 has the required property. Substituting 
this into the last equation, we get 


Therefore 


E(Y 2 ) 


n — 1 i (n — 2)(n — 3) | n — 2 
2 + 4 + 3 


V ar(Y) = ~ + 


(rt — 2)(n — 3) i n —2 
4 + 3 


(»~ 1) 2 

4 


n + 1 
12 


(6 . 21 ) 


I 


The variance of X is always nonnegative as it is the expectation of a non¬ 
negative variable. This makes the following definition meaningful. 


DEFINITION 6.33 The standard deviation of the random variable Y is the value 
y/Var(X). 
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If there is no danger of confusion as to which variable is referred to, the 
standard deviation is often denoted by cr. 

The following classic theorem shows that it is unlikely for a variable to differ 
from its expectation by a multiple of its standard deviation. 

THEOREM 6.34 

[Chebysev’s Inequality] Let A>0, and let X be any random variable that has afinite range. 
Then we have 


P[\X-iA > Act] < 

PROOF Note that 

E((X - /i) 2 ) = P[(X -fi) 2 =i}> Y i- P[(X - fi) 2 = t] 

i i> A 2 ff 2 


as the last term is obtained by omitting some nonnegative summands from 
the previous one. Since each summand in the last term satisfies i > A 2 ct 2 , this 
implies 

ct 2 > A 2 ct 2 • V P[(X - fi) 2 = i\ = A 2 ct 2 • P[\X - n\> Act]. 

i>A 2 <r 2 


Dividing by \ 2 a 2 , we get the inequality that was to be proved. | 

Chebysev’s inequality is useful when the standard deviation of a variable is small 
compared to its expectation, or compared to its maximal value. 


Example 6.35 

The number of permutations of length n with less than C_i — fin + 1 descents is 
at most d/24. [] 


PROOF Let p be a randomly selected w-permutation, and let X(p)= d(p). Then 
by symmetry, 

P[X < _ fi^fifii] = l P [\ X - M | > Vn + 1 ]. 

Therefore, applying Chebysev’s inequality with A = \/l2, we have 


I 


P[x<Y^~ V^+T] < \ 


l 

12 


1 

24' 
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6.4 An Application: Longest Increasing Subsequences 

Let p be a randomly selected n-permutation, and let Xjp), or, when there is no 
danger of confusion, X(p) denote the length of the longest increasing subsequence of p. 
What can we say about E(X)? 

We start with a classic result due to Erdos and Szekeres that will provide us 
with a lower bound for E(X). 


PROPOSITION 6.36 

Let n=km+l. Then any n-permutation p contains either an increasing subsequence of length 
k+1, or a decreasing subsequence of length m+ 1. 


PROOF Extending the idea of Theorem 4.10, we define the 2-way rank of an 
entry of a permutation. The 2-way rank of the entry p; of p is (r, s) if the longest 
increasing subsequence of p ending at p, is of length r, and the longest decreasing 
subsequence of p ending at p, is of length s. For example, the 2-way rank of the 
sixth entry of 7215436 is (2, 4). 

It is clear that no two entries of p can have the same 2-way rank as the 
rightmost of two entries would have higher first coordinate or higher second 
coordinate than the 2-way rank of the leftmost one. Therefore, all n=km+l entries 
of p must have different 2-way ranks. Since there are only km distinct 2-way ranks 
with r G [k] and s G [to], the proposition is proved by the Pigeonhole 
Principle. 

COROLLARY 6.37 

For all positive integers n, we have 


E(X) 

where the expectation is taken over all n-permutations. 


PROOF For any n-permutation p, either p or its reverse p ' has an increasing 
subsequence of length at least fri. by Proposition 6.36, so X(p)+X (p r ) > fn. 

I 


A little algebraic manipulation can lead to a better lower bound. We know that 
for any n-permutation p, 


maxr(pi) • max s(pj) > n, 

i€n j£n 
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otherwise we could not have n different 2-way ranks in p. For any positive real 
numbers a and b, we have a + b > 2\fab, and therefore 

max r(pi) + maxs(p,) > 2 \fn, 

jEn 

proving (after taking reverses), the following stronger corollary. 

COROLLARY 6.38 

For all positive integers n, we have 

E{X) > Vn, 

where the expectation is taken over all n-permutations. 


The problem of determining E(X) more precisely has been the subject of vigorous 
research in the last sixty years. The above results should at least provide an 
intuitive justification as to why the following question, also known as Ulam’s 
problem, was asked in this form. 


QUESTION 6.39 Let n go to infinity. Does the limit 


lim 

n-b-oo 


EjXn) 

\/n 


( 6 . 22 ) 


exist, and if yes, what is it? 


Corollary 6.38 shows that if (6.22) exists, then it is at least 1. The existence of 
(6.22) was first proved by Hammersley in [123]. Numerical evidence then seemed 
to suggest that this limit is probably close to 2. This result has originally been 
proved in two parts, [145] showing lim n _>oo > 2, and [193] showing 

lim ^ E i x « l < 2 
nm n _Kx> y n 

Knowing the expectation of X n only begins the story. The solution of Ulam’s 
problem opened the door to even deeper research of the distribution of X„. The 
interested reader should consult [4] for a survey of results, and [17] for the latest 
spectacular improvements in this area. 


Exercises 

1. Let i, j, k and / be four elements of [n], and let p be a randomly selected 
w-permutation, where w>4. 
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(a) What is the probability that the four elements are all in one cycle of p? 

(b) What is the probability that these four elements are in four different 
cycles? 

2. What is the probability that the three largest entries of an w-permutation are 
in the same cycle? 

3. Continuing Exercise 1, what is the probability that the four elements are 

(a) in two different cycles? 

(b) in three different cycles? 

4. Find the expectation and variance of the number of excedances of randomly 
selected w-permutations. 

5. Let n>k. Find the expected number of Ccyclcs in a randomly selected n- 
permutation. 

6. Find the variance of the number of Ccyclcs of randomly selected n- 
permutations. 

7. Let Z(p) be the number of inversions of the w-permutation p, and let n> 2. 
Compute the variance of Z 

8. Let U(p) be the number of weak excedances of the w-permutation p. Compute 
E(U) and Var(U). 

9. Is there a Ferrers shape on 20 boxes that has three hooks of length 7? 

10. Let Xhc the number of leaves of a randomly selected rooted plane tree on 
n+ 1 vertices. Find E(X). 

11. We want to build a tree with vertex set [n] satisfying as many constraints as 
possible from a finite set of constraints. The constraints are all of the type 
{(a, b)(c, d)}, meaning that the unique path in the tree connecting a to b 
should not intersect the unique path connecting c to d. 

Prove that no matter how many constraints we have, we can always find a 
tree having vertex set [n] that satisfies at least one third of them. 

12. What is the average number of alternating runs of all w-permutations? 

13. Prove Markov’s inequality. 

14. Let Xbe defined on the set A of all 1234-containing n-permutations, and let 
X(p) denote the number of 1234-copies in p. Let Y be defined on the set B of 
all 1324-containing w-permutations, and let Y(p) denote the number of 1324- 
copies of p. What is larger, E(X) or E(Y)? 

15. Let Y(p) denote the length of the cycle containing the entry 1 in a randomly 
selected involution of length n. Find E(Y). 
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16. Find a non-inductive proof for the fact that the average number of cycles of 
a randomly selected zz-permutation is H(n) = ^”=1 j- 

17. Let abc a positive real number, and let p be a randomly selected zz-permutation. 
Prove that 


P 




->• 0 


as n goes to infinity. 

18. Let X n be defined on the set of derangements of length n, and let X n (p) be the 
number of cycles of p. Prove that for zz>4, we have 


{n-l)D{n-2)E{X n .^l) + {n -l) D(n-l) E{X n . 1 )=D (n)E (X n ). 


19. Let T„ be defined on the set of derangements of length n, and let T n (p) be the 
size of the cycle containing the maximal entry of p. Find a recursive formula 
for E(Y r ). 

20. We know from Section 1.2 that for any fixed n, the sequence G(n, k) is 
unimodal. Where is the peak of that sequence, that is, for which k is G(n, k) 
maximal? 


21. Let p and q be two randomly selected zz-permutations, and let G pyq be the 
graph defined in Problem Plus 5. Let Pfp, q) be the number of Hamiltonian 
cycles in G p>q . Prove that 


E(Z) = 


n + 1 
n 


22. Let Xhc a random variable whose range is finite, and assume that X takes 
only nonnegative values. Prove that 


P[X = 0] < 


Var(X) 
E(X ) 2 ' 


23. Find an alternative proof for the result of Example 6.27 using the result of 
Example 6.2. 

24. We extend the notion of descents to Standard Young Tableaux as follows. We 
say that i is a descent of the SYT of P if z'+l appears in a row of P that is 
strictly below the row containing z. So for instance, the SYT consisting of one 
single row has no descents, while the SYT on n boxes consisting of one single 
column has n-1 descents. 

Let n be a fixed positive integer. For i £ [n — 1], let Afc the event that i is a 
descent in a randomly selected SYT on n boxes. Find a short, direct proof of 


2004 by Chapman & Hall/CRC 






Mean and Insensitive. Random Permutations. 


241 


the fact that P[Aj is independent of i. Do not use the result of the next 
exercise. 

25. Let A be a Ferrers shape on n boxes. Let % g \n - 1], and let A hk be the event 
that i is a descent in a randomly selected SYT of shape A. Prove that Pr[A ik ] 
is independent of the choice of i. 

26. A deck of cards is bijectively labeled by the elements of [n], and is originally 
arranged in increasing order of the labels. We split the deck into a smaller 
decks of consecutive cards, with empty decks allowed, then we randomly 
merge the small decks together. Note that before merging, the smaller decks 
contained their cards in increasing order. This sequence of operations is 
called a riffle shuffle. Prove that the probability that we obtain a given permutation 
p e Sn is 

/a+n—ri(p)\ 

p f„l = A- H - L 

where ri(p) denotes the number of rising sequences of p. A rising sequence is an 
increasing subsequence of consecutive integers. For instance, 1324756 has three 
rising sequences, namely 12, 3456, and 7. 

27. Deduce Theorem 1.8 from the result of the previous exercise. 

28. +AsetNof w-permutations is called min-wise independent if for all X C [n], and 
all x G X we have 


P[min{p(X)} = p x } = 

where p=p]jh---p„ is a randomly selected permutation in F. That is, each 
element of X is equally likely to be the index of the smallest entry among 
the entries indexed by X. 

Small families of min-wise permutations are important tools in efficiently 
detecting identical or near-identical websites. Prove that there exists a min- 
wise independent family of size less than 4”. 

29. Let Nbe a min-wise independent family of w-permutations. Find a lower 
bound for |F|. 

30. Let Nbe a min-wise independent family of w-permutations, and let i G [n]be 
a fixed integer. Let X C [n], and let a; c x. Denote P x the probability that 
for a randomly chosen permutation p = P 1 P 2 • • • p n G F. the entry p, is the 
zth smallest among the entries in the set p(X). Prove that 
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Note that the definition of min-wise independence automatically assures this 
in the special case when i= 1. 


Problems Plus 

1. Let i G. [n — 2], and let A be a given Ferrers shape on n boxes. Let X, be the 
indicator variable of the event that both i and z+1 are descents in a randomly 
selected SYT of shape A. Is it true that E(X) is independent of i? (Descents 
of SYT are defined in Exercise 24.) 

2. Let X, be defined as in the previous Problem Plus. Find a formula for 

m- 

3. Let l<y<z-l<zz-2, and let X hi be the indicator variable of the event that both 
i and j are descents in a randomly selected SYT of shape A. Is it true that 
E(X;J is independent of z? 

4. Let Xij be defined as in the previous Problem Plus. Find a formula for 

mj- 

5. Let p=pYpr"pn be an zz-permutation, and let Gp be the directed graph on 
vertex set [n] in which there is an edge from i to j if p=j. 

Now \et p and q be two randomly selected zz-permutations, and consider the 
graph G /v/ on vertex set [n] whose edges are the edges of G p and the edges of 
G r Let zz go to infinity. What is the probability that G p.i contains a directed 
Hamiltonian cycle? (A Hamiltonian cycle of a graph is a cycle that contains 
all vertices of the graph.) 

6. Keep the notations of the previous Problem Plus, and let p, q, and r be 
three randomly selected zz-permutations. Let G p.i. r be the graph that consists 
of the edges of Gp, G q, and G r on vertex set [n]. Let zz go to infinity. What is 
the probability that G P,i, r contains a directed Hamiltonian cycle? 

7. What is the expected number of descents in a randomly selected SYT 

of a given shape A on zz boxes? 

8. We have defined descents in Standard Young Tableaux Exercise 24. 
Continuing that line of thinking, we can extend the notion of major indices to 
SYT by setting maj( r l ) = f2i£D(T) *. Here D(T) denotes the set of all 
descents of the Standard Young Tableaux 1. 

What is the expectation F ma j ,a of the major index in a randomly selected 
SYT of a given shape A on zz boxes? 
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9. Let p be a permutation obtained by the riffle shuffle algorithm given in 
Exercise 26. Find a formula for the probability that p is of a given type. 

10. Let a=2, and let us repeat the riffle shuffle algorithm of Exercise 26 m 
times. This means that starting from the second application, the smaller 
decks will not necessarily contain their cards in increasing order. Let p be a 
fixed w-permutation. Find a formula for the probability that this procedure 
results in p. 


Solutions to Problems Plus 

1. Yes, this is true, and can be proved similarly to Exercise 25. See [124] for an 
analysis of all cases. 

2. Let A=(« b ^ 2 , ..., aj, and let the conjugate of A be denoted by A'= («j, 

■■,«»)• It is proved in [124] that 


E{Xi) = d x , 


E 

h>j>k 


a'i(a'j ~ !)K ~ 2 ) 

n(n — 1 )(n — 2) 


(6.23) 


3. Yes, this is true, and can be proved again similarly to Exercise 25. See [124] 
for details. 

4. Let us keep the notations of the solution of Problem Plus 2. Furthermore, 
let us introduce the new notation 


eA 


E 

h>j>k>l 


0,% (Oj 

n(n 


l)(»jc — 2)(q; — 3) 
l)(n - 2 )(n - 3) 


It is then proved in [124] that 

E(X iiJ ) = c y -d k -dy+e x +e y , 
where c A -is defined in (6.24) below. 

5. This probability converges to zero as n goes to infinity as is proved in [63]. 

6. This probability converges to 1 as n goes to infinity as is shown in [95]. 

7. Let the given shape A have rows of length a x , a 2 ,..., a k and columns of 
length aj, a' 2 , * - ■, a].- It then follows from a more general result of Peter 
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Hasto [124] that 


c\> = Ed,x — 


n — 1 


i+E 


* aMi - 1) A. a'(a' - 1) 


E 


fri n ( n - 1 ) n (n - 1 ) 

a't ( a 'j ~ 1 ) 


DE 


j<t 


i(n — 1) 


(6.24) 


(6.25) 


Note that by the result of Exercise 25 and the linearity of expectation, we get 
that for a given i £ [n — 1], 


P[i e D(T)} = Y, 


j<t 


a'M'j ~ 1 ) 
n(n — 1) 


(6.26) 


where 74s a randomly selected SYT of shape A. 

8. It is straightforward from (6.26) and the linearity of expectation that 


E 


maj, X 


( n \~ i, V' a »(«t - 1) _ y* djia'i - 1 ) 

\ 2 / 2 n(n - 1) pi n(n - 1) 

( n \ a t( a j ~ 

W f? t n(n- 1) * 


where the are defined as in the solution of the previous Problem Plus. 
Note that here, just as in (6.24), the two sums within the brackets cancel if 
our Ferrers shape is self-conjugate. 

9. It is proved in [76] that the probability that p is of type (w 1; n^, •••, w„) is 

n;„ t'-tr 1 ) 

a” 

where f„ is the number of aperiodic circular words of length j over an alphabet 
of a letters. In other words, /)„ is the number of ways to design aperiodic 
necklaces using ; beads having colors 1, 2, ..., a. The numbers f a are fairly 
well-studied, and it is known that 

f ja = lY^' d , 

] d\j 

where d ranges all positive divisors of j, and pi is the number-theoretical 
Mobius function, that is, pi(d )=0 if d is divisible by a perfect square larger 
than 1, otherwise pi(d)=(-l) t , where k is the number of distinct prime divisors 
of d. See [179] for this fact, and the definition of the Mobius function in a 
more general setting. 
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•2 m +n-ri 
. _ n 

^mn 



See Exercise 26 for the definition of ri(p). 
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7.1 The Robinson-Schensted-Knuth correspondence 

This chapter is devoted to the connections between permutations and various 
other objects in combinatorics. This is certainly a huge topic, and we can therefore 
only skim the surface of a few selected areas. Our goal is to give the reader an 
overview of some main lines of research to aid the decision of what literature to 
consult next. 

In the first section, we present the famous Robinson-Schensted-Knuth 
correspondence that connects the combinatorics of permutations and the 
combinatorics of Standard Young Tableaux. There are several excellent books 
[167], [100] devoted entirely or mostly to the fascinating subject of Young 
Tableaux, and we will not try to parallel them. Instead, we will be concentrating 
on those parts of the area that are most direcdy connected to the enumeration of 
permutations. 

The Robinson-Schensted-Knuth correspondence provides a direct bijective 
proof for Theorem 6.10, showing once again that the number of pairs of SYT of 
the same shape, consisting of n boxes each, is n\. This is achieved by a bijection 
risk from the set of all w-permutations onto that of such pairs. (There is no risk 
involved, but when pronounced, that word is shorter than, say, RSK.) In 
addition, the bijection has a very rich collection of interesting properties, such as 
turning natural parameters of permutations into natural parameters of SYT. 

To start, let TV=Ttin t —n n he an n-permutation. We are going to construct a pair of 
Standard Young Tableaux risk(n)=(P, QJ_. The two tableaux will have the same 
shape. The two tableaux will be constructed together, in n steps, but according to 
different rules. In the i J -tableau, some entries will move after they are placed, 
while this will not happen in the (^tableau. We will call our tableaux P and 
throughout the procedure, but if we want to emphasize that they are not 
completely built yet, we call them P, and to show that only i steps of their 
construction have been carried out. 

We are going to describe the bijection risk step by step, and we will illustrate 
each step by the example of ^=52314. 
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P Q 


FIGURE 7.1 

The situation after Step 1 of creating risk (52314). 

P Q 

314 


FIGURE 7.2 

The situation after Step 2 of creating risk (52314). 


Step 1 First, take the first entry of n, and put it in the top left corner of the 
tableau P that we are in the process of creating. Then put the entry 1 
to the top left corner of the tableau Q_that we are creating, encoding 
the fact that the top left corner was the first position of P that was 
filled. 

Step 2 Now take k 2 , the second entry of n. If nf>n^ then simply put n 2 to the 
second position of the first line of P. We then write 2 in the second box 
of the first line of (^to encode that this was the box of P that got filled 
second. 

If (as in our running example), then we cannot do this as the first 
row of P cannot contain entries in decreasing order. Therefore, in this 
case, tt 2 will take the place of n u and % will descend one line, to take 
the first position of the second row of P. To encode this, we write 2 to 
the first box of the second line of 

Step i We then continue the process the same way. Assume that the first i 
entries of n have already be placed, and that we created a pair (P h QJ of 
partial SYT on i boxes each, and of the same shape. Now we have to 
place 7T, +1 . Generalizing the rules that we have seen in the special case of 
i= 2, we look for the leftmost entry y in the first row that is larger than 
7i M . If there is no such entry, we simply put n M to the end of the first 
line. If there is such an entry y, then n t+l will take the position of the 
entry y, while y will descend one line, and look for a position for itself in 
the second row of P,. In other words, 7r I+1 displaces the smallest entry y 
of the first row that is larger than 7T I+1 . When y is looking for its place in 
the second row, the same rules apply to y as applied for 7t i+1 in the first 
row. That is, if there is no element larger than y in the second row, then 
y will be placed at the end of the second row, otherwise y will displace 
the smallest entry z in the second row that is larger than y, and this entry 
z will descend to the third row, to look for a position, subject to the same 


2 


2 _ 
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FIGURE 7.3 

The situation after Steps 3, 4, and 5 of creating risk (52314). 


rules. When this procedure ends, the resulting tableau P M will have a 
box in a position where P, did not. We will write z'+l into that position 
in the (^tableaux, to encode the fact that that position was the (z+l)st 
position of P to get filled. 

Repeating this placement procedure n times, we obviously get a pair (P, Qf= (Pn, 
Qn) of Standard Young Tableaux of the same shape, consisting of n boxes each. 
Indeed, the shapes of P and Qare identical as in each step we added a new box 
to the same position in each of them. We then set risk(n)=(P, Ql- 

THEOREM 7.1 

The map risk defined, above is a bijection from S„ to t!w set ofipairs of Standard Young 
Tableaux (P, Q) having identical shape and consisting of n boxes each. 

PROOF It suffices to show that risk has an inverse, that is, for any pair (P, QJ_ of 
SYT having identical shapes and consisting of n boxes each, there exists exacdy 
one 7t 6 S n so that risk(n)=(P, Qf 

We prove our statement by induction on n, the initial case of n= 1 being trivial. 
In order to prove the inductive step, we show how to recover the last entry n„ 
of k from (P, Qf The position of the entry n in the {^tableau reveals which 
position a of the /Tableau was filled last. As n is at the end of a row in (fi this 
position a is at the end of a row in P. If this is the first row, then there was no 
displacement involved in the last step of the creation of Pand Q^and iz„ is simply 
the content c(a) of position a. 

If a is at the end of row i, then c(a) got to a after being displaced from its 
position b in row z-1. Fortunately, we can easily recover b. Indeed, if u was 
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the entry that displaced c(a) from position b , then in the (z-l)st row, c(a) was 
the smallest entry larger than u. Therefore, u is the largest entry in the (z- 
l)st row that is smaller than c(a), and the position of u is b. Now we can row 
of P, then u could not have come from a higher row, so u must have come 
argue as in the previous paragraph. That is, if z=2, that is, b is in the first 
directly from n, forcing n=u. Otherwise, u got to b after being displaced 
from the (z-2)nd row. In the latter case, we repeat the above argument to 
find the position in the (z- 2 )nd row from which u was displaced, and the 
entry that displaced it, and so on. 

This procedure ends when we reach the first row and find out which 
entry started Step n of the tableau-creating procedure. That entry is, 
obviously, k. 

Once we have determined n m we remove the box containing n n from Pand the 
box containing n from (f By the definition of the (^tableaux, this leaves us with 
two SYT on n-l boxes that are of identical shape. (The entries of P’ are not 
necessarily the elements of [n-l], but that does not matter as they are precisely 
the entries of the partial permutation n’ By our induction 

hypothesis, we can recover n’ from the pair (P\ Qj, completing our induction 
proof. 

Example 7.2 

Let /bind Qbc as shown in Figure 7.4. Then the position of 8 in Qtells us that the 
last box to be added to Pwas the box containing 7. This is the box a of the above 
argument. Its content c(a)=7 got there after being displaced from the end of row 
2. In that row, it had to be at the end, so it was displaced by the entry 4. The 
entry 4 in turn had to be displaced from row 1, by the largest entry there smaller 
than 4. That entry is 3, so we have tt 8 =3. Q 

If risk(n)=(P, Qf we will often write P(n) and Qfn) for the two tableaux of 
risk(n). 

As we have mentioned, many parameters of n are encoded in risk(n). The 
length of the longest increasing subsequence in n is, for example, very easy to 
obtain. 

LEMMA 7.3 

If the length of t!w longest increasing subsequence ofn is k, then P(n) luis k columns. 

PROOF Recall from the proof of Theorem 4.10 that n, is called an entry of rank 
j if 7i t is at the end of a subsequence of n that is of length j, but there is no 
increasing subsequence of length y +1 in n that ends in n,. 

We prove a stronger statement by showing that the first position each entry of 
rank j of n takes during the construction of P(n) by the risk algorithm is the 7 th 
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FIGURE 7.4 

Recovering (P’ t (f) from (P, (f. 


position of the first row. This will show that the length of the first row, therefore, 
the number of columns, of P(n) is indeed the maximal rank in n. 

Our proof of this stronger statement is by induction on j. If j= 1, then we are 
looking at entries that are left-to-right minima. These entries must enter the first 
row at the first box as they are smaller than any entry previously placed. So the 
initial step is complete. 

Now assume we know the statement is true for entries of rank j, and let us 
prove it for entries of rank j+ 1 . Let x be such an entry. Let x’be the entry of 
rank j on the left of x that is closest to x. By our induction hypothesis, x’ 
entered the first row at box j. So when x starts looking for its position in the 
first row, box /is either taken by Cor (at this point we cannot yet exclude it) 
another, smaller entry that displaced x’. In either case, box j contains an 
entry that is smaller than x, so x cannot enter the first row anywhere before 
position j+ 1 . 

On the other hand, x cannot enter the first row anywhere after position j +1 
either. Indeed, assume this happens; this implies that when x starts looking for a 
position, there is an entry jy<x in position j +1 of the first row. By our induction 
hypothesis, that entry cannot have rank less than j+ 1. That is a contradiction, 
however, for we could affix x to the end of any increasing subsequence ending in 
y, yielding that the rank of x is at least j+ 2 . | 

The result of Lemma 7.3 provides an obvious alternative proof of the fact that 
S„(123"T)<(T1) 2 ”. There is, however, a much more refined application of this 
result to permutation enumeration, one that we promised in Chapter 5. 
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THEOREM 7.4 

For anyfixed positive integer k, the sequence S n (\2--k) is P-recursive in n. 

Note that this is the only result known that proves Conjecture 5.4 for an infinite 
number of patterns. 


PROOF We have just seen in Lemma 7.3 that permutations not having increasing 
subsequences of length k can be associated with pairs of Standard Young Tableaux 
having at most k-\ columns. In other words, S n (V2—k)= ffp f\ where Trans 
through all Ferrers shapes of size n which have at most k-1 columns, and f 
denotes the number of Standard Young Tableaux of shape F. Let F=(m u m,, m k - 
1) be such a Ferrers shape. Then m{>mp>---m k -1>0, and m* — n with m, 

denoting the size of the zth column. The hooklength formula now implies (see 
Exercise 23 for details) that 


fr 


(nii - rrij + j - i) 


0 mi + m 2 H-h m k -i)\ 

(mi + k - 2)!- ■ • (m k -i)l' Al) 


We can easily see that the right hand side is T-recursive in each of its variables, 
therefore, by repeated applications of Lemma 5.14, it is also ibecursive in the 
sum of these variables. This implies that 

fF = S n (l2--k) (7.2) 


is /'’-recursive in n. 

This proof used the connection, established in Lemma 7.3, between the length of 
the first row of P(k) and the length of the longest increasing subsequence of n. 
There is a far-reaching generalization of this observation. 

THEOREM 7.5 

[113] Let n be a permutation, let P(n) have k rows, and let a, denote the length of the ith row 
ofP(n). Thenfor all ie[k], the maximum size of the union ofi increasing subsequences in n is 
equal to aprap -ha,. 


Example 7.6 

Let tt='26 1735984. Then P(n) is shown in Figure 7.5. 

We see from P(n) that (a l5 a%, ag)=(4, 4, 1). On the other hand, the longest 
increasing sequence of 7ris of length four (2678), the largest union of two increasing 
sequences is of size eight (2678, 1359), and the largest union of three increasing 
sequences is of size nine. [ 
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FIGURE 7.5 

The tableau P (261735984). 
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FIGURE 7.6 

The P-tableaux of 31524 and 42513. 
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Note that, in particular, Theorem 7.5 implies that if n and n’ are two n- 
permutations so that P(n)=P(n), then the maximum size of the union of i 
increasing subsequences in n is equal to the same parameter in p ’. In other words, 
this parameter depends only on the /Tableau of a permutation. 

PROOF See [113]. The proof can also be found in [167]. | 

It seems that the increasing subsequences of n are quite well encoded by P(n). 
It is natural to ask whether there are similarly strong results about the decreasing 
subsequences of n. Fortunately, the answer is in the affirmative, because of the 
following theorem of Schensted [168] describing the /-’-tableau of the reverse n' 
of n. 

THEOREM 77 

For any n-permutation n, we have P(n) r =P(it). 

Example 7.8 

Let tz= 3 1524, then tt’= 42513, and the corresponding /’-tableaux are indeed 
conjugates of each other as shown in Figure 7.6. [] 

We need to introduce some machinery developed in [168] before we can prove 
Theorem 7.7. Let Pbe a partial SYT, that is, a tableau that we have at some point 
as our P-tableau during the construction of risk(n). That is, Phas rows and columns 
that are strictly increasing, but the integers written in the boxes of P can form 
any subset of [n], not just an initial segment. 

Now let us say the next entry of n to be inserted to Pby the risk algorithm is 
x. The partial tableau that we obtain from Ponce x is inserted will be denoted by 
r x (P), where r refers to row insertion. Before you ask what other insertion could we 
possibly talk about, we define column insertion just as we did (row) insertion, 
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f(P) 
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2 c(P) 
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FIGURE 7.7 

Row and column insertion of 2 into P. 


except that instead of rows, we use columns. That is, the entry x to be inserted 
arrives into the first column, displaces the smallest entry a that is larger than x, this 
entry a then enters the second column, and proceeds analogously. The partial 
tableau obtained from P by column inserting x will be denoted by cJP). 

Example 7.9 

Let 71=3512746, and let P be the partial tableau obtained after three steps of 
the risk algorithm. Then x=2, and the tableaux P, r 2 (P) and (^(P) are shown in 
Figure 7.7. £] 

The crucial property of row and column insertion is that they commute in the 
following strong sense. 

PROPOSITION 7.10 

Let P be a partial tableaux, and let u and v be two distinct positive integers that are not 
contained in P. Then we have 

c v (r u (P))=r u (c v (P)). 


The proof is not overly difficult, but is a little bit cumbersome as there are 
several cases to consider. The cases are based on what v and v are, and where 
they will be inserted. We do not want to break our line of thought here, and will 
therefore give the proof in Exercise 19. 

Now we are ready to prove Theorem 7.7. 

PROOF Let 7r=w 1 M 2 - "Ww then T(=u n u ltl ---u l . To alleviate notation, let r Ui = Tj, 
and let c Uj — c h and let us omit extraneous parentheses. We claim that 

Cn.Cn -1 • • - Ci(0) = r x r 2 • • -r„(0). (7.3) 

Proving (7.3) is sufficient as the left-hand side is clearly P(tt) t , and the righthand 
side is P(if). We prove (7.3) by induction on n, the initial case of n= 1 being 
obvious. As our initial tableau is empty, it does not make any difference whether 
we row or column insert an entry into it. Therefore, 

rir 2 ---r n (0) = n ■ •-r n _ic„(0) = c„r x • • • r n _ x (0), 
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P(312) Q(312) 
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P(231) 
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FIGURE 7.8 

The images of 312 and its inverse, 231. 


where the last equality holds because of Proposition 7.10. Applying the 
induction hypothesis by replacing n ■ ■ ■ r n _i (0) by c n ~i ■ • ~Ci (0), we get (7.3). 


The following theorem is perhaps even more interesting, and has a plethora of 
applications. It tells us what happens if we interchange the two tableaux that make 
up risk(n). 

THEOREM 7.11 

Letic g Sn> and risk(n)=(P, Q). Then we have risk(7T 1 )=(Q i/ P). 

This classic theorem is due to Marcel-Paul Schiitzenberger [169]. His original 
proof used induction. An elegant geometric proof that in fact proved a more 
general statement was given by X.Viennot in [194]. That proof can be found in 
English in [167]. 

Example 7.12 

Let 71=312, then 7T 1= 231, and as it is easy to verify, risk(n) and riskin' 1 ) are shown 
in Figure 7.8. [] 

In particular, n is an involution if and only if P(n)=Q[n). That is, the Robinson- 
Schensted-Rnuth algorithm provides a bijection from the set of all w-involu lions 
onto the set of all SYT on n boxes. It is therefore easy to enumerate all these 
SYT. 
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COROLLARY 7.13 

The number of all Standard Young Tableaux on n boxes is equal to 


|n/2j 


X>=E 


\F\~- 


i= 0 


( 2 * - 1 )!!. 


PROOF It is easy to see that the right-hand side is equal to the number of all 
involutions on [n]. Indeed, first choose the 2 i entries that will be parts of 2-cycles 
in ways, then take a fixed point-free involution on them. This latter can be 
done in (2z-l)!! ways as we have seen in Corollary 3.55. 

COROLLARY 7.14 

We have 


2 >• 

n> 0 


Til 


exp 



The bijection g between involutions of length n and SYT on n boxes turns the 
number of fixed points of the involutions to a simple parameter of the 
corresponding SYT. See Problem Plus 2 for that result. The existence of g also 
helps counting involutions avoiding monotone patterns, as illustrated by Problem 
Plus 11. 

There is an interesting, close connection between descents of permutations, 
and descents of the corresponding SYT. Recall that we say that i is a descent of a 
Standard Young Tableau 2fd i appears in a row in f that is strictly above the row 
in which z+1 appears in f. 

THEOREM 7.15 

Let-jx £ S w andleti £ [n — 1]. Then i is a descent of nf and only fi is a descent of Qfn). 

PROOF First assume i £ D (p\ that is, ni>n l+] . We need to show that the 
insertion of n, +] results in the addition of a new box to the P-tableaux that is 
below the box resulting from the insertion of n t . 

As 7tl>7i, +] , we know that n M gets inserted to the first row of P(n) weakly on 
the left of n r If the insertion of k, ended in the first row, then we are done, as n M 
will then have to displace an entry from the first row. 

In any case, the entry a, displaced from the first row by is smaller than the 
entry by displaced from the first row by n r Therefore, even if the insertion of n t 
ends in the second row, that of has to go on to at least one more row. We can 
then repeat this argument for ay and by instead of n,+\ and n„ and the second row 
instead of the first row, and then iterate it for further rows. We then see that the 
insertion of n,+y will always end below that of n v 
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Now assume that i ^ D(p), that is, nf<iz M . Then reversing inequalities in the 
above argument shows that the insertion of 7r I+] will end weakly above that of n t . 
This implies that in Qfji), the entry zTl will be weakly above the entry i, proving 
the second half of our claim. 


7.2 Posets of permutations 

7.2.1 Posets on S„ 

There are various ways to define a partial order on the set of all w-permutations 
for a fixed n, or on the set of all finite permutations for that matter. The first two 
permutation posets that we mention, the Bruhat order and the weak Bruhat order 
are ubiqitous in algebraic combinatorics as they can be generalized from S„ to a 
larger set of groups called Coxeter groups. The interested reader should consult 
[24] for these generalizations, as well as for further information about these 
posets on permutations. 

7.2.1.1 The Bruhat Order 

DEFINITION 7.16 Let P n be the partially ordered set of all nfermutations in which p<q 
fp can be obtainedfrom qby a series of operations, each ofwhich interchanges the two entries 
of an inversion. Then P n is called the Bruhat order on S n . 

For shortness, an operation that interchanges the two entries of an inversion will 
be called a reduction. The Bruhat order is sometimes called the strong Bruhat order 
for reasons that will become obvious shortly. 

Example 7.17 

Figure 7.9 shows the posets P 2 and P 3 . [] 

As the reader probably knows, in a poset, we say that y covers x if xNy, but there 
is no z so that xTz<iy, or visually, when y is “immediately above” x. The reader 
should spend some time justifying some of the covering relations of P 3 , in order 
to become familiarized with this partial order. For instance, why does 231 cover 
132? 

Recall that a poset is called graded if all of its maximal (non-extendible) chains 
have the same length, where the length of a chain is the number of its elements 
minus one. 
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FIGURE 7.9 

The Bruhat orders on 5 2 and S3. 


PROPOSITION 718 

The Bruhat order P n is graded. 

PROOF We claim that if y covers x in P„, then y has exactly one more 
inversion than x. This will obviously imply that all maximal chains of P„ 
have length (£)■ 

Let xNy, and assume y covers x. By Definition 7.16, xNy means that x can be 
obtained fromjy by a series of reductions. However, as x is covered by y, all series 
of reductions that turn y into x must consist of one single reduction. Let that 
reduction be the transposition (yy), where i<j,yfyj, and assume that this reduction 
results in decreasing the number of inversions of y by more than one. That 
means that there is an index k so that /</{</ and yfyiPy,- In that case, however, 
the permutation t=y(yyd would satisfy x<z <y, contradicting the assumption that 
y covers x. | 

In a finite graded poset, the rank of an element is the length of any maximal chain 
ending at that element. It follows from the above proof that rankp n (p)=i(p), where 
i(p) is the number of inversions of p& S„. 

We are going to present a classic result that provides a characterization of the 
Bruhat order. To that end, we need an additional definition. Let p be an n- 
permutation. For each (a, b ) € [n] x [n], we define 

p(a,b) = |{i G [a] so that Pi>b}\. 


2004 by Chapman & Hall/CRC 





Permutations vs. Everything Else. Algebraic Combinatorics of Permutations. 


259 




FIGURE 7.10 

The diagrams of/>=24153 and^‘ 1 =31524. 


Example 7.19 

Let/>=31452, and let (a, b)=( 3, 2). r Th.a\p(a, b)=p(3, 2) is the number of entries 
among the first three entries of p that are at least 2, that is to say, p( 3, 2)=2. 0 


THEOREM 7.20 

Let p and q be two n-permutations. Then pL^ q if and only ifp(a, b)< q(a, b) for all 
(a, b) e [n] x [n]- 

PROOF First we prove that the condition is necessary, that is, iip(a, bf>p(a, b) 
for some (a, b) 6 [n] x [n], then p<q cannot hold. Indeed, note that with any 
reductions, the value q(a, b) never increases. So no series of reductions can turn 
qto p. 

Now we prove that the condition is sufficient. Let a=n-\. Then the condition 
p(n- 1, b)<q(n-l , b) for any b means tha.tp n >q n . I{p„=q n , then our claim is immediate 
by induction on n as we can remove the last entry of both permutations. If pf>q n , 
then there is an entry q, so that z<w and q=p n . In this case, we can apply the 
reduction (q : qj to q, to get a permutation q’thai ends in the same entry as p, and 
for which q<q. A little thought (see Exercise 13) shows that p(a, b)<q’(a, b) holds 
for all (a, b), and the statement again follows by induction on n. 

This characterization of the Bruhat order leads to the following theorem. 

THEOREM 7.21 

Letp and q be two n-permutations. Then p<p n q f and only fp' l <p n cf. 

PROOF As (j) L )~ 1 =p, it suffices to prove one implication. Using Theorem 
7.20, it suffices to prove that if p(a, b)<q(a, b) for all (a, b) (E [n] 2 . then p' l (a, b) 
< q l (a, b) for all (a, b) € [n] 2 - Let us represent permutations with their 
diagrams as shown in Figure 7.10. It is obvious from the definition of inverse 
that the diagram of p' 1 is obtained from that ol /i by reflection through the x=y 
diagonal. 
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Diagram of p Diagram of q 


n-b+1 

A 

n-A-b+l 

n-b+1 

B 

n-B-b+1 

b-l 

a-A 

A+b-a-1 

b-l 

a-B 

B+b-a-1 


a n-a a n-a 


FIGURE7.il 

The number of dots in various rectangles. 


The number p(a, b) is just the number of dots in a certain rectangle in the 
top left corner of the diagram of p. (In fact, that rectangle if of shape (n- 
b+l)xa.) So our condition means that no matter how large a rectangle we take 
in the top left corner of the diagram of p, the number of dots in that rectangle 
is never more than the number of dots in the corresponding rectangle in the 
diagram of q. 

All we need to prove is that the same inequality will hold for the rectangles 
that are in the bottom right corners of the two diagrams. Indeed, taking inverses 
will turn these rectangles into top left corner rectangles. As each row and each 
column of our diagrams contains exacdy one dot, it is easy to compute the 
number of dots in these rectangles, as shown in Figure 7.11. 

Clearly, if A<B, then b-a- 1 +A<b-a- 1 +B, and our theorem is proved. 


7.2.1.2 The Weak Bruhat Order 

There is another partial ordering of permutations that is based on trans¬ 
positions. 

DEFINITION 7 22 Let Pj be the partially ordered set of all n-permutations in which 
p<bq fp can be obtainedfrom q by a series of operations, each of which interchanges two 
consecutive entries thatform an inversion. Then called the weak Bruhat order 
on S n . 

Example 7.23 

Figure 712 shows the posets Pj and Pj. [] 

It is obvious that y covers x in Pj if and only if x can be obtained from y by an 
adjacent transposition, that is, a reduction that interchanges two consecutive entries. 
This implies that the weak Bruhat order is graded, and rank(x)=i(x). 
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FIGURE 7.12 

The weak Bruhat orders on S 2 and S3. 


As P : ' n is graded, all its maximal chains have the same length, namely (”). 
This means that in any maximal chain, the entries i and j are interchanged 

exactly once. 

Now that we know that the maximal chains of P' n are so similar to each other, we 
could ask how many such maximal chains exist. This question turns out to be 
remarkably interesting. It was first answered by Richard Stanley [178], who 
proved his own conjecture showing that the number of maximal chains of P'„ is 
equal to the number of Standard Young Tableaux of shape (n- 1, n- 2, 2,1). The 
proof in [178] uses symmetric functions, so the quest for a combinatorial proof 
has continued. At this point, we mention that a chain connecting permutation p 
to the identity permutation is called a reduced decomposition of p, so the task at hand 
is finding the number of reduced decompositions of w(wT)--21. 

A year later Curtis Greene and Paul Edelman defined a new class of tableaux, 
called balanced tableau, and then found a remarkably simple bijection between 
these tableaux of the same staircase shape and the maximal chains of PC We will 
now give an overview of their results. See [79] for a short summary of all their 
results, and see [80] for the proofs of these results. 

Recall the definition of a hook from Chapter 6. For a box a in a Ferrers shape, let 
4 denote the leg length of a, that is, the number of boxes that are in the same 
column as a, and are weakly below a. 
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FIGURE 7.13 

A balanced tableau. 


DEFINITION 7 24 Let F be a Ferrers shape on m. A balanced tableau of shape F is a 
tableau whose boxes are bijectively filled with the entries of [m] so that for each box a, the content 
of a is the l a th largest entry in the hook H a . 

Note that there is some variation in the literature as to whether the content of a 
should be the /„th largest or / fi th smallest entry in its own hook, but for staircase 
shapes, this causes no confusion. Indeed, the hooklength of a in such a shape is 
always 24 - 1 , so the 4th largest entry of that hook is also its 4th smallest. 

Example 7.25 

Figure 7.13 shows a balanced tableau on nine boxes. Q 

THEOREM 7.26 

The number of maximal chains of the weak Bruhat order P £ is equal to the number of 
balanced tableaux of shape (n- 1, n- 2, 2, 1). 

PROOF We are going to construct a bijcction/from the set MC n of all maximal 
chains of P' n to the set BalSt(n) of balanced tableaux of the staircase shape (n- 1, n- 
2 , - 2 , 1 ). 

Let C € MC n be the maximal chain whose Mi edge corresponds to the 
adjacent transposition interchanging the entries i and j, with i+j. Then we define 
f(C) to be the tableau whose box in position (n+l-i, j) contains the entry k. In 
other words, column j ol f(C) describes the transpositions that moved j and a 
larger entry, while row n+l-i of f(C) describes the transpositions that moved i 
and a smaller entry. Here “describes” means “tells when it happened.” 

Example 7.27 

If n=4, and cis the chain 1234, 2134, 2143, 2413, 4213, 4231, 4321, then f(Cj is 
the balanced tableau shown in Figure 7.14. Q 

We first show that^indeed maps into BalSt(n). Let us look at the box (n+l-i, j). We 
need to prove that its content k is indeed the (i-j) th largest entry in its hook 
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FIGURE 7.14 

The tableau f(C). 


The entries below the box (n+l-i, j) tell us when the interchanges (x, j) took 
place, where x<i. The entries on the right of the box (n+ l-i,j) tell us when the 
interchanges (i,y) took place, W\\h.y>j. Originally, zand j were at distance zj/from 
each other, so there had to be exacdy i-j-1 interchanges of the above types before 
i and j could be interchanged. So the content k of the box (n+ l-i,j) is the (i-j) th 
largest of its hook, as it should be. Therefore f(C) 6 BalSt(n )• 

It is clear that /is an injection. Indeed, if C ^ C\ and the Mi edge of C and C’ 
are different, then the position of k in f(C) will be different from that of k in 

f(C) - 

We still need to prove that/Is a surjection, that is, that for all B t BalSt(n), 
there exists a C E MC n so that f(C)=B. Take B E BalSt(n), and try to find its 
preimage under /. "Then B specifies an order in which we should carry out the (”' ) 
transpositions on the decreasing permutation. What we have to show is that the 
balanced property of B assures that in every step, we will be asked to carry out 
an adjacent transposition. 

In order to prove this statement, we need the following, somewhat surprising 
characterization of balanced tableaux, given by Greene and Edelman. It shows 
that in fact, staircase shaped balanced tableaux are even more “balanced” than 
we would think. Let c(i,j) denote the content of box (i,j) of a given tableau. 

LEMMA 7.28 

[80] Let B be a tableau of staircase shape (n- 1, n- 2, ■■■, 1) whose boxes are bijeciivelyfilled 
with the elements of[ (”) ]. Then B is balanced if and only if, for all (i, j) satisfying i+j<n, 
andfor all s~>i, exactly one of c(s,j) and c(i, n-s+1) is larger than c(i, j). 

To remember the two boxes whose content brackets c(i, j), note that in the first 
one, the second coordinate is fixed, the first is changed to s, in the second one, 
the first coordinate is fixed, the second is changed to n-s+ 1. 

PROOF The “if” part is not surprising. Indeed, Hy-(i, j) can be partitioned into 
disjoint unions of pairs [c(s, j), c(i, n-s+1)}, and if c(i, j) is larger than exacdy one 
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FIGURE 7.15 

The entry c(s, t) leads to a contradiction. 


element from each pair, then it is larger than half of the entries in H,j-(i,j), and so 
B is balanced. 

It is surprising, however, that this seemingly stronger condition is also necessary, 
that is, if B is balanced, then this condition has to hold. Assume not, and assume, 
without loss of generality, that there exists si>i so that c(i, jf>c(s, j), and also, c(i, 
jf>c(i, n-s+1). Then, by the balanced property of B, there exists another index t 
so that c(i, j)<r(t, j), and also, c(i, j)<c(i, n-t+1). Assume, again without loss of 
generality, that £>s. 

Let us now assume that H tl is a minimal counterexample to our statement (It is 
easy to check that in that case, we must have hf> 5.) The reader is invited to 
follow our argument in Figure 7.15. For shortness, in this figure we set x’=n-x+ 1, 
and also, we marked entries known to be larger than c(i, j) by black dots, and 
entries known to be smaller than c(i, j) by gray dots. 

As we assumed that H t j was a minimal counterexample, our statement must 
hold for H s j, and H lit -. Therefore, because of H s j, we must have 

c(s, t)<£(s,j)<c(i,j). 

On the other hand, because of H(i, t), we must have 

c(s, t’f>c(i, t’f>c(i,j). 

As the last two chains of inequalities clearly contradict each other, our lemma is 
proved. | 

Now we can return to the proof of the suijectivity of f Let B € BalSt(n), and 
let us start building up a maximal chain Cby decoding B. That is, in step m, let us 
interchange the entries i and j, where m=c(n-i+l,j in B. We must show that each 
step will actually define an adjacent transposition. Assume not, and let k be a 
minimal counterexample. That is, assume k asks us to interchange x and y, so 
that x<Ly, but x andy are not in consecutive positions. 
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First we show that if x<z<y, then z cannot still be between x and y. Indeed, 
look at H n+ i_ yx . This hook has k written in its peak, and its length is 2(y-x)-l. 
As B is balanced, there arc y-x- 1 entries in H n+l _ y x that are less than k. This 
means that before step k, there have been y-x- 1 interchanges of the types (x, 
z), with x<z, and (z, y), with z <y. So no z € {x,y) could be still located 
between x and y. Now let iKxfy, and assume that v has been interchanged 
with x at some point before step k. This means that c(n+l-x, v)<k. However, 
by Lemma 7.28, c(n+Tx, v) has to be larger than exactly one of c(n+l-y, v) 
and c(n+l-x,y)=k. Because of the previous sentence, the only way for this to 
happen is c(n+l-y, v)<.c(n+ 1-x, v)<k. So in particular, c(n+Ty, v)<.k, meaning 
that y and v were also interchanged before step k, so v is not located 
between x and jy. 

Finally, an analogous argument shows that if x<jy<H, and u and y were 
interchanged before step k, then so were u and x. Therefore, no element can 
be located between x and y, so x and y are adjacent as claimed. 
Consequently, the chain C can always be built up so that /(<:)= B, and our 
proof is complete. 

We have mentioned that Stanley [178] has proved that the number of all 
maximal chains in the weak Bruhat order of S„ is the number of all Standard 
Young Tableaux of shape (n- 1, n- 2, ■■■, 2, 1). Now we have seen the proof of 
Edelman and Greene showing that this number is also equal to the number of 
balanced tableaux of shape (n-1, n- 2, •••, 2, 1). This certainly means that there are 
as many balanced tableaux of this shape as SYT of this shape. It turns out that a 
much more general statement is true. 

THEOREM 7.29 

[80] Let F be any Ferrers shape, and let If be the number fall balanced tableaux ofshape F. 
Then 


b F =f. 

In other words, there are as many balanced tableaux of any given shape as there 
are SYT of that same shape. See [80] for a proof of this theorem. The proof 
follows from some sophisticated bijections. There are some sporadic cases in 
which the bijection is simple, but in the general case it is not. 

Theorem 7.29 shows that the number of balanced tableaux of a given shape is 
also given by the hooklength formula. It would be interesting to find a 
probabilistic proof of this fact. 


7.2.2 Posets on Pattern Avoiding Permutations 

Let P A be the partially ordered set of 132-avoiding w-permutations ordered by 
strict containment of the descent sets. That is, in P A we have p<q if D(p) C D(q). 
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It is then clear that is ranked, and we know from Problem Plus 1 of 
Chapter 4 that there are T (f) ( A .” 1 ) elements of rank k in P'f In particular, this 
means that Pf has as many elements of rank i as of rank n-l-i, in other words, 
Pf is rank-symmetric. 

There is a much deeper notion of symmetry in posets. 


DEFIMTlOJV 7.30 We say that a poset P is self-dual f it has an antiautomorphism, that 
is, f there exists a bijectionf :P—>P so that p< P q f and only tff(q)<f(p). 


A bijection^having the property required in the above definition is often called 
order-reversingbecause of what it does. 


Example 7.31 

For any positive integer n, the Boolean algebra B„ of all subsets of [n], ordered by 
containment, is self-dual. 

PROOF Let / be the bijection that maps each subset of [n] to its complement. 
Then/ is clearly order-reversing, proving our claim. 

THEOREM 7.32 

The poset pf is self-dual. 

This result was first announced in [36], but its proof contained a little oversight. 
That was corrected in [27]. The proof we present here is significandy simpler 
than the previous proof. 


PROOF Let p £ pf . Recall Exercise 27 of Chapter 4. Define f(p) to be the 132- 
avoiding permutation whose unlabeled binary tree is obtained from the unlabeled 
binary tree T(p) of/> by reflecting T(p) through a vertical axis. By part (b) of the 
mentioned exercise, this reflection will turn left edges into right edges, so ascents 
into descents, and vice versa. In particular, the vertex that was in the ith position 
from the left will now be in the ith position from the right. Therefore, by part (c) 
of the mentioned exercise, if pi>P M , then we will have f(p„.) < fp n + i-i)- So i £ D(p) 
if and only if n - if D{f{p)) for i £ [n — 1]. 

In other words, the descent set of f(p) is precisely the reverse complement of the 
descent set of p, implying that yds order-reversing. 

Let be the poset of 321-avoiding w-permutations ordered by strict containment 
of the set of excedances. 
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PROPOSITION 7.33 

The posets pfi and Qf are isomorphic. 

PROOF For each S C [n — 1], let iff 21 (S) be the set of 321-avoiding n- 
permutations with excedance set S C [n — 1]. 

Let also Dlf 2 (a{S)) be the set of 132-avoiding w-permutations with descent 

set equal to a(S), the reverse-complement of S. 

We construct a bijection s:E^ 21 (S) —> D* 32 (a(S)) illustrated by Example 

7.34. If p g E 321 (S), then, as seen earlier in the definition of 0, the entries /■ 
with j (j: S form an increasing subsequence. This, and the definition of excedance 
imply that pj is a right-to-lefi minimum (that is, smaller than all entries on its right) if 
and only if j £ Exc(p) = 5. 

Now let p ’=p,p n --[ ■ ■ -p\ be the reverse of p. ThenjCis a 123-avoiding permutation 
having a left-to-right minimum at position i<n exactly if w+ 1 — i ^ S- 

Recall from the proof of Lemma 4.3 that there is exactly one 132-avoiding 
permutation p" which has the same set of left-to-right minima and has them at 
these same positions. Namely, p' is obtained by keeping the left-to-right minima 
o{p 'fixed, and successively placing in the remaining positions, from left to right, 
the smallest available element which does not alter the left-toright minima. We 
set s(p)=p". From the proof of Theorem 7.32 we see that i g Des(p") if and only 

if n — i S for j g [n — 1], in other words, when i g a (5), and so/'belongs 
indeed to D^ 2 (a(S))- 

It is easy to see that s is invertible. Clearly, p ’ can be recovered from p" as the 
only 123-avoiding permutation with the same values and positions of its left-to- 
right minima as p". (All entries which are not left-to-right minima are to be written 
in decreasing order). Then p can be recovered as the reverse of P’. 

Thebijectionss:Ti^ 21 (5) Lti 32 (o:(S))for all choices of S C [n— l]produce 

an order-reversing bijection from Q * to Pf. But pf is self-dual, so the proof is 
complete. 

Example 7.34 

Take p = 341629 5 10 78g Ef^(S) for S = {1,2,4,6,8}. Then its 
reversalp—8 7105926143 has left-to-right minima 8, 7, 5, 2,1 in positions 1, 2, 4, 6, 8. 
We obtain s(p)=p"= 8 79562314 10, a permutation in D{q 2 ({1, 3,5,8})- [] 


7.2.3 An Infinite Poset of Permutations 

Let Pbe the poset of all finite permutations ordered by pattern containment. That 
is, in this poset, p<q if and only if p is contained in q as a pattern. This means that 
the closed classes of Chapter 5 are precisely the ideals of P. 
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We have seen in Chapters 4 and 5 that if we want to find permutations that 
avoid all of the patterns contained in a given set S, then our task is getting 
progressively harder if new elements are being added to S. It is therefore 
reasonable to ask whether this task will eventually become impossible. That is, 
let JVbe an arbitrary positive integer. Is it possible to find M permutations so that 
none of them contains any other as a pattern? Or, even more strongly, is it 
possible to find an infinite antichain in the poset P? 

This question was attacked, and the affirmative answer discovered and 
rediscovered several times, during the last third of the twentieth century. Here 
we present a construction that may be chronologically the first. The 
construction (without proof of the antichain property) was published by Taijan 
in [188] in 1972. 

THEOREM 7.35 

The poset P contains an infinite antkhain. 

The above result could be reformulated by saying that P is not a well-quasi 
ordering. 


PROOF Let ri>2, and let 

p n =2(fin-l) 4 1 6 3 8 5—(4w-2) (4w-5) (4 n) (4w-3). 

In other words, p n has 4 n entries, and consists of two parts, the increasing 
subsequence 246-4w in the odd positions, and the odd entries in the even 
positions in increasing order, except for 4w-l, which is moved up into the second 
position. We claim that the p„ form an infinite antichain. 

Assume the contrary, that is, that p t <p n for some k<n. Which entries of p„ 
could play the roles of the entries of pfi The role of 4T1 has to be played by 4 n- 

1, otherwise we could only find at most two smaller entries on its right. 
Therefore, the entry 2 of p n must play the role of the entry 2 of p k . This, however, 
totally ties our hand in making the remaining 4/c2 selections. Indeed, the entry 
1 of p n must be chosen to play the role of 1 as that is the only entry smaller than 

2, therefore the entry 4 of p n has to be chosen to play the role of 4 as that is the 
only entry not yet selected that precedes 1. These forced selections continue, 
and we have to choose the leftmost 4/c2 entries of/;,, to play the roles of the first 
4T2 entries of p n . Then we must choose 4 n to be the next-to-last entry as that is 
the only entry larger then 4w-l, forcing us to choose 4w-3 to play the role of the 
last entry of p k . However, this last entry is too large. Indeed, it is larger than the 
entry 4/;-2 that we choose when we selected the (4/;-3)th entry of our purported 
copy of p k . This is a contradiction, as in p k the last entry is smaller than the entry 
in position 4A-3. | 
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7.3 Simplicial Complexes of permutations 

We have defined simplicial complexes in Problem Plus 1 of Chapter 1, but we 
repeat that definition for easy reference. 


DEFINITION 7.36 A simplicial complex A is afamily of subsets of an underlying set 
S so that ifF E A and G C F then G g A. 

In other words, a simplicial complex is an ideal of the Boolean algebra with 
underlying set S. The sets that belong to the collection A are called the faces of A. 
If S 6 A has i elements, then we call S an (z-1)-dimensional face. The dimension 
of A is, by definition, the dimension of its maximal faces. 


Example 7.37 

Let Pbe a finite partially ordered set. Then the collection of all chains in P forms 
a simplicial complex, called the chain complex of P. 

Indeed, every induced subposet of a chain is a chain. 

If we can prove that certain objects, say w-permutations with /{-descents, are in 
bijection with /{-element faces of a given simplicial complex, that can have 
additional algebraic significance. In fact, additional algebraic interpretations can 
be found for the numbers enumerating our objects. The algebraically inclined 
reader is invited to consult [46] for details. 


7.3.1 A Simplicial Complex of Restricted Permutations 

As we have mentioned, and in some cases, shown, there are many objects 
enumerated by the Catalan numbers. Among all of these, we now choose 231- 
avoiding permutations, and show how they form a simplicial complex. The reason 
for this choice is that 231-avoiding permutations fit well in a more general class 
of permutations, called /-stack sortable permutations, to which we will return in 
the next chapter. The following theorem explains what we mean when we say 
that the set of these permutations form a simplicial complex. 

THEOREM 7.38 

There exists a simplicial complex (with an underlying set off) elements) whose ( k-1) -dimensional 
faces, that is, k-element faces, are in bijection with 231-avoiding n-permutations having k 
ascents. 


PROOF We have seen in Exercise 17 of Chapter 4 that 231-avoiding 
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FIGURE 7.16 

Each subset of S defines a lattice path. 


w-permutations are in bijection with northeastern lattice paths from (0, 0) to (n, n) 
that never go above the main diagonal. We have also seen in that same exercise 
that this bijection turns the ascents of the permutations into north-to-east turns 
of lattice paths. 

Now let A be the simplicial complex of all sets S of points for which there 
exists a northeastern lattice path r so that the set of all north-to-east turns of r is 
equal to S. (Of course, we still have to show that A is indeed a simplicial 
complex.) It is clear that given S, we can recover r. So the faces of this simplicial 
complex A are indeed in bijection with 231-avoiding w-permutations. The 
previous paragraph shows why (AT)-dimensional faces correspond to 
permutations with k ascents. 

Finally, we prove that A is indeed a simplicial complex. Let g g /y Then 
there exists a subdiagonal northeastern lattice path r so that the set of northto- 
east turns of r is equal to S. That is possible if and only if the points of S form a 
chain in N 2 (in the natural cordinate-wise ordering), are between certain limits, 
and have all different horizontal and vertical coordinates. However, if that is the 
case, then that must be true for all subsets T C S, implying that there is a 
northeastern lattice path t whose set of north-to-east turns is 7. That means that 
T £ A' an d the proof is complete. | 


Example 7.39 

Let n= 4, and let 6={(2, 1), (3, 2)}. Then the northeastern lattice paths whose set 
of north-to-east turns is S, or a subset of S are shown in Figure 7.16. [] 
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7.3.2 A Simplicial Complex of All m-P ermutations 

There are several ways to define a simplicial complex whose ^-element faces are 
in bijection with n-permutations having k descents. One of these, [103], was 
mentioned in the solution of Problem Plus 1 of Chapter 1, and another one can 
be found in [81]. Here we present such a simplicial complex based on the bijective 
representation of permutations by labeled lattice paths given in Lemma 1.28. 

Let p = p\p 2 * * 'pn be an w-permutation having k descents, and let us say that 
D(p)={d h df Then by Lemma 1.28, we can represent/; by a northeastern 
lattice path F(p) consisting of k vertical and n-k horizontal edges, where the edges 
are labeled according to certain rules explained after Theorem 1.26. For easy 
reference, here are the rules again. 

Let the edges of F(p) be denoted «i, « 2 , —, a m and let e, be the label of a,. Then 
the following has to hold. 

(i) The edge al is horizontal and c,=l, 

(ii) if the edges a, and a i+1 are both vertical, or both horizontal, then e>e M , 

(iii) if «, and a M are perpendicular to each other, then e,+e l+ i<i+\. 

The set of labeled lattice paths of length n satisfying these conditions is 
denoted V(n). 

We are going to decompose F(p) into a Ttuple of northeastern lattice paths, 
(F<\>, F<2>, F< a>) , each of which will consist of one vertical step and n- 1 horizontal 
steps. The unique vertical step of F <t> will be in the same position as the ith 
vertical step of F(p). In other words, the unique vertical edge of F< j> will be its 
(T +I )st, corresponding to the zth descent of/;. We still have to specify the labels of 
the edges of F <t> . Let e ht be the label of the /ill edge of F <;> . We then set 


1 if i < di, 

6j if di + l < j < di+i, 

s 1 if d i+ 1 + 1 < j. 


It is straightforward to check that this is a valid definition, that is, rules (i)—(iii) 
are satisfied. 


Example 7.40 

Figure 7.17 shows how we decompose F(p) if/;=612435. [] 

Now let A be the family of Ttuples {F <x> , ifa>, —, F <k> ) that can be obtained from 
^permutations the way described above, for some k. Note that instead of saying 
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FIGURE 7.17 

Decomposing F(p) into {F <c> , F<2>). 


F 


< 2 > 


“Ftuples,” we might as well say “sets” as the order of the F <;> within each such set 
is completely determined by the position of the single vertical edge in each 

THEOREM 7.41 

The collection of sets A defined in the previous paragraph is a simplicial complex. 


PROOF The proof is very similar to that of Theorem 7.38. Let F'= (F <r> , F <2> , 
■■■, F <Jt> ) be a Ftuple of lattice paths from P(n) having one single vertical edge 
each. Let the vertical edge of F<£> be the bfia edge of F<j>. Then F' € A if and 
only if bi<bpC-<.b k . If F’ has this property, then obviously so do all its subwords 
(subsets), so F' G A implies F" £ A, for all F" C F'- | 


Exercises 

1. Prove that 7„(123-"A)<(F1)”. 

2. Find a bijection from the set of 321-avoiding permutations with k ex- 
cedances to the set of noncrossing partitions of [n] having k+\ blocks. 
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3. Prove, without the use of the hooklength formula, that the number of 
Standard Young Tableaux of shape 2xn is C„. 

4. What is the average number of descents of a 2 xn Standard Young 


Tableau? 


5. Let k be a permutation of length 80, that avoids the pattern 12---9, and 
assume that P(k) has ten rows. How long is the fifth row of P(n)? 

6. Find a formula for the number of northeastern lattice paths from (0, 0) to 
(n, n) that never go above the main diagonal and are symmetric about the 
diagonal x+y=n. 

1. Prove that/ n (123) = (^ n ” 2 j). 

8. Prove that /„( 123•••/:) is Precursive. 

9. Let D kr (n) denote the number of n-permutations p in which the longest 
increasing subsequences have k elements and for which r is the largest 
natural number so that there exist r disjoint increasing subsequences of 
maximum size in p. Prove that then D kr (n) is a Precursive function of n. 

10. Let k be a permutation so that the first row of P(k) is of length a, and the 
second row of P(k) is of length b. Is it true that n has two disjoint increasing 
subsequences Si and s 2 so that .j, has length a and s 2 has length b ? 

11. Prove that 



F> 


where F’ ranges over the set of Ferrers shapes that can be obtained from F 
by omitting a box (which is necessarily an inner corner). 

12. Let n be any permutation other than the increasing or decreasing one. 
Find a simple way to create another permutation a so that P(n)=P(o). 

13. Complete the proof of Theorem 7.20 by showing that p(a, b)<q’(a, b) for 


all (a, b ) e [n} 2 - 


14. For what positive integers n is the number of involutions of length n even? 

15. Is there an infinite antichain in the poset Pof finite permutations ordered 
by pattern containment that consists of 123-avoiding permutations? 
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16. Keeping in mind that the reverse or the complement of an involution is not 
necessarily an involution, prove that nevertheless, for all positive integers 
n, we have 

I n ( 123-A)=/.(A-321). 

17. We can view a permutation p as a poset P p as follows. The elements of P p 
are the entries oi p, and x<i P y if (x, y) is a non-inversion of p. Prove that P p 
and Pp- 1 are isomorphic. 

18. Let k be any positive integer. For some n, find k different w-permutations P, 
so that all the P p . are isomorphic to one another. 

19. Prove Proposition 7.10. 

20. Let us consider all n\ vectors that are obtained by permuting the coordinates 
of the w-dimensional vector 

( 1 \ 

2 

\ n J 

Denote by n„ the convex hull of all of these vectors in R' 1 , that is, the smallest 
convex set that contains all of these vectors. It is clear that n„ is a polyhedron, 
and therefore, it is often called the permutahedron. How many edges does n n 
have? 

21. Prove that 

2fc - n + 1 (n + 1\ \ 2 _ i ff) 
n + 1 \k + l) ) n + 1 ’ 

22. +Prove that the number of dissections of a convex (w+2)-gon by d 
nonintersecting diagonals is equal to the number of Standard Young 
Tableaux of shape (d+ 1, d+ 1, 1, 1, •••, 1), where the number of rows of 
length 1 is n-l-d. 

23. Complete the proof of Theorem 7.4 by showing that the hooklength formula 
indeed implies (7.1). 

24. Recall the definition of a crossing in a set partition from Problem Plus 4 of 
Chapter 5. Let us call a partition noncrossing if it has no crossings. 

(a) Define a bijection from the set of noncrossing partitions of [n] having 
k blocks to the set of 132-avoiding w-permutations having k-i descents. 


E 


fc=L(n+l)/2j 
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(b) Define the partially ordered set NC(n) of noncrossing partitions of [n] 
by refinement. That is, TcA NC (n) 7r’if all blocks of n’ are unions of 
blocks of n. Compare this poset to the poset P A defined in Section 
7.2.2. 

25. Recall that a partially ordered set Pis called a lattice it for any two elements 
x.y 6 P, the set 


{z E P\x < x and y < z} 
has a (unique) minimum element x V y, and the set 
{u E P\x > u and y > u} 
has a (unique) maximum element x A y. 

Is the weak Bruhat order P ! :l a lattice? 

26. Prove that there exists a bijection / : P’’ so that x A f{x) =123 ■■■n 

and x V f(x) = n(n — 1) • • • 21. See the previous exercise for the relevant 
definitions. 

27. A finite lattice L is called complemented if for any element x £ E there exists 
a unique element y E L so that x A y = 0 and x V y = 1, where 0 denotes 
the minimum element of L, and 1 denotes the maximum element of L. 

Is I f a complemented lattice? 

28. Let /„ be the induced subposet of the (strong) Bruhat order P„ whose 
elements are the involutions of length n. Is /„ a lattice? 

29. Let I n be defined as in the previous exercise. Is /„ self-dual? 

30. The dimension of the poset Pis the smallest positive integer d so that Pis the 
intersection of d total orderings. What is the dimension of the poset P p 
defined in Exercise 17? 

31. +Find an asymptotic formula for the number of 2-dimensional posets on n 
labeled elements. 

32. We have defined the permutahedron in Exercise 20. Clearly, n„ is in fact 
an (w-1)-dimensional polyhedron as all its vertices lie within the hyperplane 
£”_i Xi = ("+ 1 ). So in particular, 7 t 4 is three-dimensional. Do all faces of 

have the same number of edges? 

33. Isttp be a randomly selected involution of length n. Let i £ [n — 1]. What 
is the probability that i is a descent of fP 

34. Decide if the following statements are true or false. In all three statements, 
p and q are two w-permutations, and their respective sets of inversions are 
I(p) and I(q). 


2004 by Chapman & Hall/CRC 


276 


Combinatorics of Permutations 


(a) If I(p)<I(q), then p<q in the Bruhat order. 

(b) If p<q in the Bruhat order, then I(p)<I(q). 

(c) If p<q in the weak Bruhat order, then I(p)<I(q). 


Problems Plus 

1. Let D k (n) be the number of w-permutations in which the longest increasing 
subsequences have k elements and they all have at least one element in 
common. Prove that D k (n) is a /'’-recursive function of n. 

2. We have seen in Theorem 7.11 that the Robinson-Schensted-Knuth 
correspondence naturally defines a bijection between inversions of length 
n and SYT of size n. Lct / bc this bijection, and assume that we are told the 
shape of f(p) (so not the content of each box). How can we figure out the 
number of fixed points of p from this information? 

3. What is the number of 321-avoiding fixed point-free involutions of length 
2 n? 

4. What is the number of 123-avoiding fixed point-free involutions of length 
2 n? 

5. Prove that the number of 123-avoiding involutions of length 2w+l having 
exactly one fixed point is 

/<' , (123)= ( 2 " n +1 ). 

6. Let /W ^ denote the number of involutions of length n having exactly k 
fixed points. Prove that 


/W(321) = 4 fc )(132) = 4 fc )(231) 


£+l((n-t)/ 2 ) for n + k even, 
0 for n + k odd. 


7. Let P be a poset having n elements, and let a { denote the length of its 
longest chain. Let be defined so that the largest number of elements that 
the union of two chains can contain in Pis ap-Oi. Similarly, for i> 2, let a, be 
defined so that the largest number of elements that the union of i chains 
can contain in P is ap-aprai. Continue defining the a, as long as they are 
positive. 
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(a) Prove that a 1 >a 2 >--->a i , where a k is the last a, that has been defined (in 
other words, ff { a } = n) ■ Therefore, there exists a Ferrers shape 

F=(a h <%, ■■■, a t ). Let b , be the length of column i of F. Prove that any 
positive integer j, the sum bp-bprbj is equal to the largest number of 
elements that can be covered by j antklmins in P. 

(b) Why is the result of part (a) a generalization of Theorem 7.5? 

8. Let us try to generalize the result of the previous Problem Plus as follows. 

Let G be a graph on n vertices, and let a, be defined so that apt-ap -t-«, is 

equal to the largest number of vertices that can be contained in the union 
of i cliques (complete subgraphs) in G. Again, define the a, as long as they 
are positive. 

(a) Is it true that a{>ap>—>ap 

(b) Assuming that the answer to the question of part (a) is yes, define b, 
as in part (b) of the previous Problem Plus. Is it true that the sum 

bp-bp - Ybj is equal to the largest number of elements that can be 

contained in j antidiques (independent sets of points, in other words, 
empty subgraphs) in G? 

9. For what class of graphs will statements (a) and (b) of the previous Problem 
Plus follow directly from the result of Theorem 7.5? 

10. Find a formula for 7„(1234). 

11. (a) Find a formula for 7,(12345). 

(b)Find a formula for 7„(123456). 

12. Prove that for all positive integers k and n, we have 

(Yj S„(12 * • • fc + 1) = ) (—l) r 7 r (12 • • • k + l)hn-r(12 ■ ■ ■ k+1). 

13. Let F(2n) be the induced subposet of the Bruhat order P 2n whose elements 
are fixed point-free involutions of length 2 n. 

The rank-generating function of a finite graded poset P is the polynomial 
Wp(x) = J2 k i-o where r, is the number of elements of T’that are of 

rank i. Prove that 


^2„,=[l][3]-[2n-l]. 


14. Find an asymptotic formula for the number of isomorphy classes of 
2dimensional posets on n elements. Here [m]=l +q+q ml . 
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Solutions to Problems Plus 

1. We claim that our condition that all maximum-length increasing 
subsequences have a common element is equivalent to the seemingly weaker 
condition that any two maximum-length increasing subsequences intersect. 
If we can prove this, then our problem will be reduced to the special case 
r= 1 of Exercise 9. 

To prove our claim, assume that p is a permutation in which any two 
increasing subsequences of maximum length k intersect. We construct a 
directed graph G p associated to p. The vertices of G p are the entries of p and 
there is an edge from the entry i to the entry j if and only if i=j and i is on 
the left of j. So an increasing subsequence of length k in p corresponds to a 
directed path of length k in G p . Now let us remove all edges not in any 
maximum-length-path from G p , and add a “source” .s and a “sink” t to get 
the graph G'. That is, s and t are vertices so that s has indegree zero, and 

there is an edge from s to all left-to-right minima of p, while / has outdegree 
zero and there is an edge to t from all right-to-left maxima of p. So each 
increasing subsequence of size k corresponds to an j^^path of maximum 
size in a natural way. Now suppose these directed paths of maximum length 
do not have a vertex in common. Then we can delete any vertex v and still 
have an s—>t path in G' p . In other words, G' p is 2 —(s, ^-connected, which 

implies, by the famous theorem of Menger (see for example [146]) that 
there are at least two vertex-disjoint s—>t paths in G'. This is equivalent to 

saying that there are two increasing subsequences of size k in p which are 
disjoint, which is a contradiction and the proof is complete. 

2. The number of fixed points of^ is equal to the number of columns of odd 
length in f(p). See [23] for a proof of this fact. 

3. We have seen in the previous Problem Plus that the number of fixed points 
of an involution equals the number of odd columns of its P-tableau. 
Therefore, a fixed point-free involution has no odd columns. If, in addition, 
such an involution avoids 321, then its P-tableau cannot have columns 
longer than 2. This implies the shape of this P-tableau must be 2xw. As an 
involution is completely determined by its P:ableau, we conclude from 
Exercise 3 that the number of such SYT is C n . 

4. Let us first try to proceed as in the previous Problem Plus. In any case, the 
rows of the P-tableau of such an involution must be of length at most two, 
so the P-tableau has two columns. Because of the fixed point-free criterion, 
odd columns are not allowed. So (except for the trivial, one-column case), 
the P-tableau will have two columns, of length 2 (n-k), and 2k, where k<n!2. 
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It is then routine to compute by the hooklength formula that the number 
of these tableaux, and therefore, the number of fixed point-free 123-avoiding 
involutions of length 2 n is 



2n - 4fc + 1 
2n — 2 k + 1 


This is not a particularly simple formula. 

However, it has recently been proved in [75] that in fact, the number of 
these involutions is ( 2n n ~ 1 )- The proof uses Injections to certain lattice paths 
called Dyck paths. It follows from the solution of Exercise 7 that 
hn+i (q) = ( 2 ” n h1 )- As an 123-avoiding permutation has to have exactly 
one fixed point, the result follows. 

5. It is clear that jW (132) = I„ (231) f° r an y n and k as 231 is the reverse 
complement of 132. The enumerative formula for 132 is proved in [119], 
and the Injection between the sets enumerated by J W (132) and J(321) 
is given in [75]. 

6. (a) This is the famous Greene-Kleitman-Fomin theorem, which has 

several proofs. The two earliest ones are [114] and [92]. A different 
proof, using the concept of orthogonal families of chains and antichains, 
was given in [94], 

(b) For any permutation p, we can take its permutation poset P p . The 
chains of P p will be in bijection with the increasing subsequences of 
p, and the antichains of P p will be in bijection with the decreasing 
subsequences of p. Note that in this special case, the proof of our 
claim follows from Theorem 7.7. 

7. (a) No, this is not true. See Figure 7.18 for a counterexample. It is easy 

to see that for the graph shown in that figure, we have a, =3, <22=1, 
and tf 3 = 2 . 

(b) This is not true either. See Figure 7.19 for a counterexample. One 
checks easily that « 1 =fl 2 =2, and 0^=1. This would have to imply ^=3, 
but there is no anticlique of size three in this graph. 

8. The class of graphs we are looking for is that of comparability graphs of posets. 
If P is a poset, then its comparability graph G(P) is the graph whose vertex 
set is the set of elements of P, and two vertices are connected by an edge if 
the corresponding elements of Pare comparable. Then the cliques of G(P) 
correspond to the chains of P, and the anticliques of G(P) correspond to the 
antichains of P. 
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FIGURE 7.18 

A counterexample. 



FIGURE 7.19 

A counterexample. 
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9. The numbers /„(1234) are equal to the Motzkin numbers, that is, 

n/2 . . 

/ n (1234) = X> - 

i=0 ' ' 

This result was first proved in [161], who uses symmetric functions in his 
argument. A simpler proof is given in [180], Exercise 7.16.b, but that proof 
still uses symmetric functions. In recent years, there are a plethora . of 
results on the subject, that together yield that /„(1234)=M„, and do not use 
symmetric functions. In fact, it is known that 

M=I n (2143) =/„ (1243) =I„ (1234). 

The first of the above three equalities was bijectively proved in [121], the 
second one was bijectively proved in [120], and the third one follows from 
a results proved in [127]. That result is I n (12q)=I„(21q), for any pattern q 
taken on the set (3, 4, •••, k). Therefore, 

/„(1243)=/„(2134)=/„(1234), 

where the first equality is true because if/; is an involution, then so is the 
reverse complement of p. 

We point out that [127] contains the even stronger result that I n {\2'iq)= 
/„(321^), where q is any pattern taken on the set {4, 5, k}. It would be 

interesting to find a more direct, combinatorial proof. 

10. (a) The number 7„(12345) is in fact the number of Standard Young Tableaux 
on n boxes having at most four columns. It is proved in [111] that 

{ Cl if n = 2 k is even, 

CkCk+i if n is odd. 


(b)It is proved in [111] that 


Ln/2j 

7 n (123456) = 6 V 

i =o 


Ci 


(2i + 2)! 

(i + 2)!(* + 3)! 


This is the largest k for which an exact formula is known for /„(12-T). 

11. This result is due to H.Wilf, who gave a generating function proof in 
[200]. Such an elegant result certainly asks for a direct bijective proof, but 
none is known to this day. 

12. This result is proved in [68] where the authors define an interesting notion 
of weight on fixed point-free involutions. 

13. It is proved in [202] that the number of these classes is (l+o(l))w!/2. The 
result was also proved by El-Zahar and Sauer. 
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8.1 Generating Permutations 

8.1.1 Generating All w-permutations 

If we want to write a computer program to test a conjecture concerning 
permutations, we need to have an efficient method to generate all //-permutations 
for our machine. If the conjecture only concerns permutations with some 
restrictions, we can save a lot of time and effort by having a fast way to generate 
only those permutations with the required property. 

One can certainly list all permutations lexicographically. That is, let us define 
a partial order on the set of all n-permutations as follows. Letp=p i p 2 '"pn < fi c l 2 '" c ln 
if for the smallest index i for which p t ¥^q b we have Pfcy. The total order defined 
on S n is called the lexicographic order. So for instance, 34152<35412 since the 
smallest index for which p { and q { are different is i= 2 , and pft-q 2 - 

It is obvious that the smallest element of S„ in the lexicographic order is 
the identity permutation 1 2--n. Therefore, in order to construct an algorithm 
to list all w-permutations in the lexicographical order, it suffices to have a 
method to find the permutation immediately following a given permutation p in 
the lexicographic order. Such a method is provided by the following proposition. 
Recall that in a poset we say that q covers p if p<q, and there is no r so that 
p<dNq. 

PROPOSITION 8.1 

Let p=p\pr"pn he a permutation. Let i be the largest ascent of p. Then t!u permutation q 
covering p in the lexicographical order is given by q= P\Pr"P, \q t q,+vqn, where q t is the 
smallest element in {p l+ i, pn. 2 r"P n that is larger thanp i} and tlw string q M ---q n contains the 
remaining entries ofppi+yp n in increasing order. 

Example 8.2 

If ^>=2415763, then r=4, and ^=2416357. G 
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PROOF Let q be as above, then p<q as pf q r Now assume there exists an n- 
permutation /=r 1 r 2 ---r„ that so that pr'CrCq. Then r 1 r 2 -"r ! : ] = pipvpi-i must hold, 
otherwise one of p<r and rCq could not be tme. As we must have p,—r t ^q u 
and q { is the smallest remaining integer larger than p,, we must have either 
pr=Ti, or r=q v Both are impossible, however. Indeed, in the first case, we could 
not have p<r, as the rest of p is decreasing, and in the second case, we could 
not have r<q, as the rest of q is increasing. | 

Proposition 8.1 provides an obvious algorithm that generates all permutations of 
length n, one by one, in the lexicographic order. The algorithm will stop when the 
index i, that is, the largest descent of p, cannot be found; that is, when p=n(n- 
1)—1, the maximum element of our total order, cannot be found. 


8.1.2 Generating Restricted Permutations 

A huge amount of research has been done on objects counted by the Catalan 
numbers. These objects, of which R.Stanley lists over 150 different kinds in 
[180] have been enumerated according to various statistics, leading to interesting 
open problems. It is therefore desirable to be able to generate these objects 
efficiendy. We will show how to do this with 231-avoiding permutations. 

First, we define a total order H„ on the set of all 231-avoiding n-permutations 
as follows. Let p=LnR and q=L’nR ’be two w-permutations, where L denotes the 
string on the left of n in p, and R denotes the string on the right of n in p, and L’ 
and R ’are defined analogously for q. Note that L and R could be empty. Now let 
|Z] denote the length of L. We say that p—H„ q if 

(a) \L\>\L\ or 

(b) |Zj=|Z,j, and L<Z7in H^, or 

(c) f\=f’\, and L=L’, and R<R’in H\ R \. 


Example 8.3 

Let n=8. Then ^=21348576 ^32184567=^ as L is of length four, while L’ is of 
length three, so rule (a) applies. Similarly, if /=32148567, then p<r as 2134<3214, 
by the repeated application of rule (b) (213 <321). Finally, if j= 21348765, then 
p<s by rule (c). Q 


Example 8.4 

For rt= 4, the list of all elements of H„ in increasing order is 1234, 2134, 1324, 
3124, 3214, 1243, 2143, 1423, 1432, 4123, 4213, 4132, 4312, 4321. □ 
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It is straightforward to see that H„ is indeed a total order, (as any two elements 
are comparable) with minimum element 12--w and maximum element n— 21. 
Note that the order H„ is quite “left-heavy”, that is, a little change made at the 
beginning of a permutation can influence the rank of the permutation much 
more than a little change at the end of the permutation. 

This observation suggests that we try to locate the unique element q covering 
p=LnR in H n by the following algorithm. 

Start with p=LnR. We will describe how to transform certain parts oip. Parts 
that are not mentioned in a given step are left unchanged by that step. We point 
out that a 0-element and 1-element strings are considered monotone decreasing. 

(i) If R is not monotone decreasing, then replace p by R, redefine L and R, 
and go to (i) again. 

(ii) If R is monotone decreasing, but L is not, then replace p by L, redefine L 
and R, and go to (i) again. 

H If L and R are both monotone decreasing, then move the first (leftmost) 
entry of L to the position immediately after the maximal entry of p. 
Order L and R increasingly. Then we put our strings back together, and 
get permutation q. 

The above algorithm will output a permutation for all inputs pAn(n-{)--{. 

Example 8.5 

• Let p>= 21348576. Then T=2134, and R=576. So R is not monotone 
decreasing, therefore step (i) applies. 

• Now we look at p>= 576, and see that in it, both L and R are monotone 
decreasing. Therefore, step (iii) applies, and we obtain the string 756. 

• Concatenating this with the unchanged part of the original permutation, we 
get that ^=21348756. 

□ 


Example 8.6 

• Lct j)=2 15439876, then L=21543, and A—876. So R is monotone decreasing, 
but L is not. So step (ii) applies. 

• Now/>=21543, so L=21 and i?=43. Both L and R are monotone decreasing, 
so step (iii) applies. 
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• In step (iii), we move 2 beyond 5, and rearrange L and R in increasing order. 
This gives the string 15234, which, concatenated with the unchanged part 
9876, gives ^152349876. 


□ 


THEOREM 8.7 

For any 231-avoiding n-permutation p, t!w algorithm described above outputs the unique n- 
permutation q tlmt covers p in H n . 


PROOF Note that being 231-avoiding is equivalent to the property that all 
entries of L are smaller than all entries of R, and that this holds recursively for L 
and R, and their recursively defined sub words. Steps (i) and (ii) of the algorithm 
will preserve this property as they operate within L and within R. Step (iii) moves 
the largest entry of L to the leftmost position of R, meaning that R now starts with 
its smallest entry. This again assures that the “entries of L are smaller than 
entries of R" property is preserved, therefore q is indeed 231-avoiding. 

Now we show that q indeed covers p. We are going to prove this by induction on 
n. The initial cases of n= 1 and w=2 are obvious. Now assume we know that the 
statement is true for all positive integers less than n. 

\{p=LnR is such that at least one of L and R is not monotone decreasing, then it 
is straightforward to see by the induction hypothesis that q covers p. If L and R 
are both monotone decreasing, then the only way to find permutations that are 
larger than p in H n is by moving n closer to the front. If we want to find a 
permutation that covers p, then we must move n up by one position. As our 
permutations are 231-avoiding, this means that the entry that gets beyond n in 
this move must be the largest entry of L. As L is monotone decreasing, its largest 
entry is its leftmost one, just as prescribed in Step (iii). Finally, to get the smallest 
possible permutation with the new osition of n, we must order L and A increasingly. 
This completes the proof. I 


Note that by the obvious bijections (reverse, complement), versions of the above 
algorithm will generate all 132-avoiding, 213-avoiding, or 312-avoiding 
permutations. It is less obvious how to generate all 321-avoiding or 123-avoiding 
permutations efficiently. 
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8.2 Stack Sorting Permutations 

The task of sorting, that is, arranging n distinct elements in increasing order efficiently, 
is a central problem of computer science. There are various sorting algorithms, 
like merge sort, or !uap sort, that can handle this task in 0(n log n) steps. It is also 
known that in the worst case, we indeed need this many steps. The interested 
reader is invited to consult a book on the Theory of Algorithms, such as [136], to 
learn these results. 

In this chapter we discuss some combinatorial sorting algorithms. In these 
algorithms, our goal is still to arrange n distinct objects (for the sake of shortness, 
the elements of [«]) in increasing order, but our hands will be tied by certain 
rules. 

Let p=pip 2 '"pn be a permutation, and assume we want to rearrange the entries 
of n so that we get the identity permutation 123--W. This does not seem to be a 
mountainous task. However, it will become more difficult if we are told that all 
we can use in our sorting efforts is a vertical stark that can hold entries in increasing 
order only (largest one at the bottom). We are even told how we have to use the 
stack (though this will rather help us than hold us back, as we will explain after 
Example 8.9). 

We note that there are alternative combinatorial sorting tools that lead to 
interesting problems; for instance, there are linear sorting devices that can 
receive or release entries on both ends. We will discuss some of these in Section 
8.3. In the present section, we will devote all our our attention to a certain 
greedy approach, which was first studied in detail by Julian West [196], though 
Tarjan did some work on the subject before that. Chronologically, this is not 
the earliest approach, but this is the one that has received most attention. Since 
this approach seems to be the prevailing one, we will often refer to it simply as 
“stack sorting,” instead of the longer, and more precise term “greedy stack 
sorting.” We will mention some other approaches in later sections, and in the 
exercises. 

In order to stack sort p=pipr"p« in the greedy way, we first pi, and put it in the 
stack. Second, we must take/^. \[p 2 <.pi, then it is allowed for p 2 to go in the stack 
on top oi pi, so we must put it there. It pP>pi, however, then first we take pi out 
of the stack, and put it to the first position of the output permutation, and put /; 2 
into the stack. We continue this way: at step i, we compare P, with the element 
currently on the top of the stack. If P,<r, then p, goes on the top of the stack, if 
not, then r goes to the leftmost empty position of the output permutation, and pi 
gets compared to the new element that is currently on the top of the stack. The 
algorithm ends when all n entries passed through the stack and are in the output 
permutation s(p). 

We are now ready to announce the most important definition of this section. 
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output 

stack 

input 



3142 


3 

142 


1 

3 

42 

1 

3 

42 

13 

4 

2 

13 

2 

4 


132 

4 


1324 




FIGURE 8.1 

Stack sorting^=3142. 


DEFINITION 8.8 If the output permutation s(p) defined by the above algorithm is the 
identity permutation 123--W, tlmi we say that p is stack sortable. 

Example 8.9 

Figure 8.1 shows the stages of stack-sorting the permutation 3142. We conclude 
that s(p)= 1324, therefore 3142 is not stack-sortable. D 

If you think that our sorting algorithm is too arbitrary, in that it requires the 
entries in the stack to be in increasing order, or that it tells us when an entry 
should enter or leave the stack, consider the following. If at any point of time, 
the entries in the stack were not in increasing order, then we could surely not 
obtain the identity permutation at the end since no entry x currently below an 
entry y in the stack could pass y in the output. So the monotone increasing 
requirement for the stack is not a real restriction. Now that we know this, let us 
look at the other requirement. That requirement says that if pfjr, where r is 
currently on the top of the stack and p, is the next entry of the input, then p, goes 
on the top of the stack, if not, then r goes to the output. We claim that this is not 


2004 by Chapman & Hall/CRC 













Get Them All. Algorithms and Permutations 


289 


a real restriction either. Indeed, if p t is the larger one, then it cannot go on top of 
r as that would destroy the increasing property of the stack. If p, is the smaller 
one, then r cannot go to the output before pi does as that would create an 
inversion in the output. This shows that if a permutation is sortable by a stack, 
our algorithm will output the identity permutation, and the mentioned criteria 
are not limiting our possibilities. 

What permutations are stack sortable? This is one of the very few easy 
questions of this area. 


propositions. 10 

A permutation is staek-sortable if and only if it is 231-avoiding. 


PROOF Let p be stack-sortable, and assume entries b, c, and a of p form a 231- 
pattern in this order. Then b would enter the stack at some point, but would have 
to leave it before the larger entry c could enter it. As a could only enter the stack 
after c, it is clear that a would arrive to the output after b did, implying that s(p) 
would contain the inversion ba, which is a contradiction. 

Conversely, assume that p is not stack sortable. Then the image s(p) contains 
an inversion yx. That means y>x, so y must have entered the stack before x did, 
and must have left the stack before x even arrived there. What forced y to leave 
the stack before x arrived? By our algorithm, this could only be an entry z that is 
larger than y and is located between y and x in p. However, in that case yzx was a 
231-pattern in p. I 

Note that we have in fact proved the following statement that will be useful in 
later applications. 


COROLLARY 8.11 

The entries u<3v ofp will appear in s(p) in decreasing order if and only if there is a 231- 
pattem in p whose leftmost element is v and whose rightmost element is u. 


8.2.1 2-Stack Sortable Permutations 

We have seen that only C„<4” permutations of length n are stack sortable out of 
n\ total permutations, meaning that greedy stack sorting in itself is not very 
efficient. What we can do , however, is send our output s(p) through the stack 
again. 


DEFINITION 8.12 The permutation p is called 2-stack sortable if we have 
s(s(p))=?(p)=\T3—n. Similarly, p is called t-stack sortable f we have s‘(p)=l23---n. 
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It is not hard to see that no n-permutation will require more than n -1 
applications of s to be sorted. In other words, all w-permutations are (n-l)-sortable. 
See Exercise 5 for a proof of this fact. 

Encouraged by the simple result of Proposition 8.10, one might try to 
characterize 2-stack sortable permutations by classic pattern avoidance. These 
efforts, however, are bound to fail. Indeed, permutation classes that are defined 
by pattern avoidance are always closed, that is, if p is in such a class, then so 
are all the substrings of p. This is not true for 2-stack sortable permutations, 
however. 


Example 8.13 

The permutation ^=35241 is 2-stack sortable. Indeed, P(p)=s( 32145) = 12345. 
On the other hand, its substring ^>=3241 is not 2-stack sortable, as 
Pfe)=j(2314)=2134. □ 


For 2-stack sortable permutations, some characterization is nevertheless possible. 
This is the content of the next lemma. 


LEMMA 8.14 

[196] A permutation is 2-stack sortable if and only if it does not contain a 2341-pattern, and 
it does not contain a 3241-pattem, except possibly as part of a 35241-pattern. 


PROOF Assume first that p is 2-stack sortable. Then by Proposition 8.10 s(p) 
must be 231-avoiding. However, if bcda were a 2341-pattern in p, then bca 
would be a 231-pattern in s(p). Similarly, if cbda were a 3241-pattern in p that 
is not part of a 35241-pattern, then s(p) would contain a 231-pattern. This 
pattern could be bca (if there is no 231-pattern starting with c and ending with 
b) or cfa (if f is the largest entry between c and b in p so that cfb is a 231- 
pattern) . 

Assume now that/> is not 2-stack sortable. Then s(p) is not stack sortable, 
therefore it must contain a 231-pattern. Let bca be such a pattern. Then it 
follows from Corollary 8.11 that there is a 231-pattern beta and a 231-pattern 
cea in p. If b is on the left of c in p, then we see that bcea is a 2341-pattern in p. 
If b is on the right of c'mp, then again by Corollary 8.11, there cannot be any 
entry between c and bmp that is larger than c. However, that implies that cbea 
is a 3241-pattern in p that is not part of a 35241-pattern, completing our 
proof. | 

The problem of finding an exact formula for the number W 2 (n) of 2-stack sortable 
w-permutations is a difficult and fascinating one. 
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THEOREM 8.15 

For all n, we have 


W 2 {n ) = 


2(3n)! 

(n + l)!(2n + l)! 


The above formula was first conjectured byJ.West in his Ph.D. thesis [196], who 
pointed out that these numbers also enumerate nonseparable rooted planar maps 
on n+l edges. The first proof was provided five years later by D.Zeilberger 
[207], who used a computer to find the solution to a degree-9 functional equation. 
Two other proofs [77], [118] have been found later. Both construct fairly complicated 
bijections between the set of 2-stack sortable w-permutations and the 
aforemendoned planar maps. The latter have been enumerated by Tutte [191] in 
1963. Finally, a bijecdon between two-stack sortable permutations, and a certain 
class of labeled trees, the so-called /?(1, 0)-trees was discovered [64], but a simple 
proof is yet to be found. Even a simple proof for the much weaker claim that 
W 2 (h) < ( 3 ”) could be very useful as it could provide a first step towards finding 
upper bounds for the numbers of /-stack sortable permutations, for t> 2. 

We close our introduction to 2-stack sortable permutations with a fascinating 
fact that has not been satisfactorily explained yet. It is a classic result of Kreweras 
that does not look related at first sight. 


THEOREM 8.16 

The number of latticepaths starting at (0, 0), endingat(i, 0), and using3n+2isteps, each 
of which is equal to either (l, 1), or ( 0, -1), or (-1, 0), that never leave thefirst quadrant is 


4 n (2i + 1) /2A /3 n + 2 i\ 

(n + i + 1) (2 n + 2i + 1) \ i ) \ n ) 

See [45] for more information about this result, including an explanation of 
why none of the existing several proofs show the “real reason” for which the 
formula holds. If we set f=0, then we get that the number of such lattice paths 
that end at (0, 0) in 3 n steps is precisely 2 2n l -W 2 (n). If someone could find a 
direct proof of this fact, that could lead to a simple proof for the formula for 
the numbers W 2 (n). 


8.2.2 /-Stack Sortable Permutations 

Let W,(n) be the number of /-stack sortable w-permutations. There are almost 
no results concerning the exact enumeration of these permutations. See 
Exercise 9 for the best known upper bound, that is far from what seems to 
be the truth. 

( 2 n \ 2(' 3 "1 

Encouraged by the exact formulae \Y 1 ( t ) = W and pp 2 [f — _ +1 ^y- +1 y 
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we might think that in the general case, we could hope for a formula like 



wher cp(n) is a polynomial with rational coefficients. Numerical evidence, however, 
does not seem to support this conjecture for any polynomials of reasonably 
small degree. Nevertheless, it seems still possible that W,(n)^ t+ ^ n ). 

The following simple observation at least narrows the field of permutations 
that have a chance to be /-stack sortable. 

PROPOSITION 8.17 

Ifp is t-stack sortable, thenp avoids the pattern 23-"(/+2)l. 

PROOF If p contains such a pattern, then the entries forming a copy of such a 
pattern will form a 23-"/l(/+l)-patternin s(p), a 23"-(/-l)l/(/-tT)-pattern in &(p), 
and so on. Finally, they will form a 23145-"(/+l)-patternin T(p), implying that 
C (p) contains a 231-pattern, and is not stack sortable. So its image, d(p), is not 
the identity permutation. | 

Finally, we would like to invite the reader to look at Exercise 11. In this exercise, 
we show how the /-stack sorting operation can be simulated using / distinct stacks 
in series, using a certain right-greedy algorithm. 

8.2.2.1 Symmetry 

Let W,(n, k) be the number of /-stack sortable permutations with k descents. We 
propose to fix n and t, and investigate the sequence W,(n, For instance, if 

n=5, and 1—5, then we find the sequence 1, 25, 62, 25, 1, while for n= 4 and t= 1, 
we get the sequence 1, 6, 6, l.A look at further numerical evidence suggests 
several interesting properties of these sequences. The simplest one is symmetry, 
that is, it seems that W,(n, k)=W,{n, n-l-k), or in other words, there seem to be as 
many /-stack sortable w-permutations with k descents as with k ascents. In fact, for 
t=n-l, the statement is obvious, considering Exercise 5, while for /= 1 and t= 2 we 
can verify that the statement is true using the known explicit formulae. See 
Problem Plus 1 of Chapter 4 for the relatively simple formula for t= 1, and see 
Problem Plus 1 of this chapter for the more complex formula for /=2. These are 
not sporadic special cases. 


THEOREM 8.18 

For allfixed n and t, we have W, (n, k)=W, (n, n-l-k). 
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FIGURE 8.2 

The decreasing binary tree of ^=3762514. 


PROOF There seems to be no trivial reason for this symmetry. Indeed, our 
usual symmetries (reverse, complement) that turn ascents into descents do not 
preserve the /-stack sortable property, even when 1=1. In order prove our theorem, 
we need to find a more subtle symmetry that turns ascents into descents, and 
preserves the /-stack sortable property for any t. 

Before we can define this symmetry, we need to extend the notion of binary 
plane trees that we discussed in Exercise 27 of Chapter 4 in relation to 231 avoiding 
(that is, stack sortable) permutations. These were unlabeled plane trees. Our 
extended notion will associate a labeled plane tree to each w-permutation. Some 
of our hardest-working readers have already seen the following definition, that 
was made in Exercise 31 of Chapter 1. We repeat it for easy reference. 

DEFINITION 8.19 The decreasing binary tree T(p) of the n-permutationp is the labeled 
plane tree whose root is the vertex associated to the entry n, and whose left subtree corresponds to 
t!w subword ofp preceding n ('instructed by this same rule), and whose right subtree corresponds 
to t!w subword ofp following n (constructed by this same rule). 

Example 8.20 

The decreasing binary tree of the permutation 3762514 is shown in Figure 
8.2. Here we have s(p)= 3214567. [j 
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It is easy to prove (Exercise 31 of Chapter 1) that decreasing binary trees on 
n nodes are in bijection with w-permutations. 

There are two reasons for which decreasing binary trees are fitting to the task 
at hand. First, just as their remote cousins, the binary plane trees, they encode 
the number of descents. Indeed, the reader is invited to prove the simple fact that 
the number of descents of p is equal to the number of right edges (that is, edges 
that go from northwest to southeast) of T(p). Second, the stack sorted image s(p) 
can easily be read of T(p), thanks to the following proposition. 

PROPOSITION 8.21 

Letp be an n-permutation, and letp=LnR, where L (resp. R) denotes the (possibly empty) 
string of entries on the left (resp., right) oftthe entry n. Then we have 

s(p)=s(L)s(R)n. (8.1) 


PROOF By the definition of the stack-sorting algorithm, the entry n can only 
enter the stack when it is empty, that is, when all entries on the left of n passed 
through the stack. This shows that s(p) will start with the string s(L). Once L 
passed through the stack, n will enter. However, as n is larger than any other 
entry, n will not be forced out from the stack by any other entry. So n will stay at 
the bottom of the stack until all other entries have passed through it. This shows 
why s(p) will continue with the string s(R), and then end in the entry n. 

This important “left-right-w” property could be used to define the stack sorting 
operation. Here is what it means in the context of decreasing binary trees. 


COROT.T.ART 8.22 

Given T(p), we can read off s(p) from T(p) by reading the nodes oftT(p) postorder. That is, 
first we read the left subtree oftthe root, then the right subtree of the root, and then the root itself. 
Each subtree is read by this same rule. 


PROOF Immediate by induction on n, using Proposition 8.21. See Figure 8.2 
for an example. | 

Now we are ready to define the bijection that will prove Theorem 8.18. We will 
define this bijectionyin terms of decreasing binary trees as this approach shows 
the “real reason” for the nice symmetries we are proving. As there is a one-to- 
one correspondence between these trees and w-permutations, ft can easily be 
interpreted in terms of permutations as well. 
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FIGURE 8.3 

The tree f(T(p)) for p=3762514. 


DEFINITION 8.23 Let T(n, k) be the set of decreasing binary trees onn vertices that have 
k right edges, and letTijp) £ T(n, k). For each vertex v ofT(p) do asfollows. 

(a) Ifv has zero or two children, leave the subtrees of v unchanged. 

(b) Ifv has a left subtree only, then turn that subtree into a right subtree. 

(c) Ifv has a right subtree only, tlwn turn that subtree into a left subtree. 

Letf(T(p<)) be the tree we obtain from T(p) this way. 

Example 8.24 

Let ^=3762514, then T(p) is the tree shown in Figure 8.2. 

To construct f(T(p)), note that the nodes corresponding to 3, 7, 2, 5, and 1 
have either zero or two children, so they are left unchanged. The node 6 has a 
right subtree only, so that subtree is turned into a left subtree, while the node 4 
has a left subtree only, therefore that subtree is turned into a right subtree. We 
end up with the tree f(T(p)) shown in Figure 8.3. 

We then read off that the corresponding permutation is f(p)= 3725416. 0 
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It is then clear that £ T(n, n — 1 — k), so/"has the desired effect on 

the number of descents of the permutation p. 

You could say “Fine. So you have proved in an ever so sophisticated way 
that there are as many n permutations with k descents as there are with k 
ascents. I can do it by just reversing all permutations.” At this point, you would 
be right in saying so. However, our bijection /preserves t!w t-stack sortableproperty 
for any t, implying that the restriction of f to the set of i-stack sortable n- 
permutations is just what we need to prove our theorem. This is the content of 
the next lemma. 

For a permutation p, we will write ftp) for the permutation whose decreasing 
binary tree is f(T(p)). In other words, f(F(p))=F(f(p)). 

LEMMA 8.25 

For any fixed n and t, the n-permutation p is t-stack sortable if and only if f(p) is t-stack 
sortable. 

PROOF We claim that 

s(P)=s(f(p))> ( 8 - 2 ) 


that is, the stack sorted images of p an&f(p) are (i-l)-stack sortable at the same 
time, which clearly implies the statement of the Lemma. 

Intuitively speaking, what (8.2) says is that pushing some lonely left edges to 
the right or vice versa does not change the postorder reading of F(p). Let us 
make this argument more precise. 

We prove (8.2) by induction on n. For n=i and n—2, the formula obviously 
holds. Now assume we know the statement for all integers less than n. 

If the root of T(p) has two children, then p=LnR, and the postorder 
reading of F(p) is just the concatenation of the postorder reading of F(L), 
the postorder reading of F(R), and n. By our induction hypothesis, the 
postorder reading of 'F(L) is the same as that of T(f(L)), and the postorder 
reading of F(R) is the same as that of T(f(R)). Therefore, as the root of 
F(f(p)) has two children, and they are roots of the trees F(f(L)) and F(f(R)), 
the postorder reading of T(p) and that of T(f(p)) are identical as they are 
concatenations of identical strings. 

If the root of F(p) has a left child only, then p=Ln, and the postorder reading 
of T(p) is that of F(L), with n added to the end. In this case, the root of T(f(p)) has 
only one child, and that is a right child. This child is the root of a subtree isomorphic 
to F(L). This is no problem, however. As the root has only one child, the postorder 
reading of T(f(p)) is just the postorder reading of this one subtree, that is, 7 (L), 
with n added to end. This proves our claim. If the root of 'T(p) has a right child 
only, the argument is the same with “left” and “right” interchanged. 
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Therefore, the restriction of f to the set of /-stack sortable w-permutations 
maps to the set of /-stack sortable w-permutations. We have seen that f maps 
permutations with k descents to permutations with k ascents, so our theorem is 
proved. 


8.2.3 Unimodality 

A good feature of the proof of Theorem 8.18 was that we could prove a relatively 
strong statement about /-stack sortable permutations, without even knowing what 
they are, in the sense of an explicit characterization. In this subsection, we are 
continuing this line of work. 

THEOREM 8.26 

For allfixed n and t, the sequence W,(n, k) 0sl ^ n _i is unimodal. 

Before we prove the theorem, we point out that while the special cases /=1 and 
t= 2 can again be verified by the aforementioned explicit formulae, the special 
case t=n- 1 is not nearly as obvious as it was when we proved symmetry. 

PROOF As we have seen, in Theorem 8.18, that the sequence at hand is 
symmetric, it suffices to prove that W t (n, k) < W t (n, k+ 1) for k < [(n-3)/2j. 

We resort to decreasing binary trees again, and the outline of our proof will 
also be somewhat similar to that of Theorem 8.18. We will first define an injection 
z:T(n, k)—>T(n, k+1). Then we will show that z preserves the /-stack sortable 
property, completing our proof. 

Let The any decreasing binary tree on n nodes. We define a total order of the 
nodes of T as follows. Let us say that a node v of 7'is on levelj of T if the distance 
of v from the root of 7’is j. Then our total order consists of listing the nodes on 
the highest level of 7 going from left to right, then the nodes on the second 
highest level left to right, and so on, ending with the root of T. 

Let 7’ be the subgraph of 7 induced by the smallest i vertices in this total 
order. Then 7” is either a tree or a forest with at least two components. If 7’ is a 
tree, thcn/fT’j is a tree as described in Definition 8.23. If 7’is a forest, then we 
define f(T) as the plane forest whose /all component is the image of the /ah 
component of 7,. 

Now let k < [(n — 3) /2J, and let T € T(n, k )• We define z(7) as follows. Take 
the sequence 7j, T 2 , —,T n . Denote by 1(1) (resp. r(T)) the number of left (resp. right) 
edges of the forest 1. Find the smallest index i so that l(T)-r(T)=\. We will now 
explain why such an index always exists. If 7) is a left edge, then /(7j)-r(7j)=l-0=l, 
and we are done. Otherwise, at the beginning we have /(7j-r( 7j)<l, wliilc at the end 
we have l(T n )-r(T n ) > ((n - 1) - [(n - 3)/2j) - |_(n - 3)/2j > 1, because 
of the restriction on k. So at the beginning, l(T)-r(T) is too small, while at the end, it 
is too large. On the other hand, it is obvious that as i changes from 2 to n, at no 
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FIGURE 8.4 

A decreasing binary tree and its image under z. 


step could l(Tj)-r(Tj) “skip” a value as it could only change by 1. Therefore, for 
continuity reasons, it has to be equal to 1 at some point, and we set i to be the 
smallest index for which this happens. 

Now apply f to the forest 7’, and leave the rest of 7 unchanged. Let z(I) be 
the obtained tree. Before we prove that z is an injection, let us look at an example. 

Example 8.27 

Let T be the decreasing binary tree shown on the left of Figure 8.4. The small 
numbers denote the ranks of the vertices in the total order defined above, and 
the large numbers are the entries of the permutation associated with p. 

Then the smallest index i for which l(7’)-r('I')=[ is j=2, so we have to apply/’ 
to the tree 7/ the tree that is encapsulated in Figure 8.4. The image z(T) is then 
shown on the right-hand side of Figure 8.4. 0 

Let us return to the proof of the injectivity of z. 

LEMMA 8.28 

For allk < |_(n — 3) /2 J, the function z described above is an injection from F(n, k) into F(n, 
k+l). 
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PROOF It is clear that z maps into T{n, k+1) as z consists of the application of f 
to a subgraph 7’in which left edges outnumber right edges by one. We know that 
f turns left edges into right edges and vice versa, so z indeed increases the number 
of right edges by 1. 

To see that z is a injection, note that z(T) is the smallest subforest of z(T) that 
consists of the first few nodes of z(T) in the total order of all nodes and in which 
right edges outnumber left edges by one. 

Now let JJ £ T(n,k+ 1), If U does not have a subforest that consists of 
the first few nodes of U in the total order of all nodes and in which the right 
edges outnumber the left edges by one, then by the previous paragraph, U 
has no preimage under z. Otherwise, the unique preimage z- 1 (U) can be found 
by finding the smallest such subforest, applying f to it, and leaving the rest of 
U unchanged. | 

So the injection z provides yet another proof of the fact that the Eulerian 
numbers form a unimodal sequence. In order to show the relevance of z to /- 
stack sortable permutations, we still have to show that z preserves the /-stack 
sortable property. This will be done again in a way that is quite similar to the 
proof of Theorem 8.18. Denote by s(T) the postorder reading of the decreasing 
binary tree T. 

LEMMA 8.29 

For any decreasing binary trees Tfor which z(T) is defined, we have s(T)= s(z(T)). 

PROOF The only place T and z (I) differ is in their first i vertices in the total 
order. However, even those parts agree in their postorder readings as they are 
images of each other under the postorder-preserving bijection f. | 

The proof of Theorem 8.26 is now immediate. Restrict z to the set of decreasing 
binary trees corresponding to /-stack sortable w-permutations with k descents, 
where fc < [(n — 3)/2j- By the two preceding lemmas, this restriction maps 

injectively into the set of trees corresponding to /-stack sortable n-permutations 
with k+[ descents, proving our claim. | 


8.2.3.1 Log-concavity 

Having seen that the sequence W,(n, k Josfen-i is unimodal for any fixed n and k, it 
is natural to ask whether these sequences are log-concave as well. Surprisingly, a 
general answer for this question is not known. It is rather rare in enumerative 
combinatorics to have a sequence that arises naturally from some enumeration 
problem, is known to be symmetric and unimodal, and is not known (but widely 
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thought) to be log-concave. So our sequences W,(n, £) 0 ==fc=«-i are examples of a 
rare phenomenon. 

There are some sporadic results, though. For ^=1 and 1=2, the result is 
immediate from the explicit formulae for our numbers. For t=n- 1, the statement 
is equivalent to the log-concavity of the Eulerian numbers. See Exercise 13 for 
the special case of t—n-2. 


8.2.3.2 Real Zeroes 


Numerical evidence seems to suggest the even stronger statement that for any 
fixed n and t, the polynomial 

n—1 

W n , t (x) = s jrWt(n,k)x k+1 

k =0 

has real zeroes only. This observation has not been proved to hold in all cases 
yet. The only simple special case is when t=n- 1, because in that case the statement 
simply claims that the Eulerian polynomials have real zeroes only. 

The special cases of t= 1 and 1=2 are not obvious, even though we have exact 
formulae for the numbers IV, (n, k). Both of these special cases have recently been 
given relatively simple proofs by Peter Branden, though the case of ^=1 follows, 
using heavier machinery, from a result of Francesco Brenti. 


8.3 Variations Of Stack Sorting 


As we mentioned in the previous section, the stack sorting operation s we have 
been discussing is not the earliest one in this line of research. The first 
comprehensive study of stack-like sorting devices is due to D.E.Knuth, [136], 
Section 2.2.1. His approach to stacks was slightly different than ours. He considered 
the following problem. 

Let us start with the permutation 123 - - - /i, instead of ending in it, and let us try 
to obtain as many permutations from it as possible, using the stack. This time we 
do not require that the entries in the stack be in increasing order as that would 
prevent us from obtaining any permutation different from the identity. Therefore, 
at any point of time, we have two choices. Either we put the entry on the top of 
the stack to the output, or we put the next entry of the input into the stack. 

If an n-permutation p can be obtained from 12---W this way, then we call it 
obtainable. 

Example 8.30 

Let p=231. Then p is obtainable as we can proceed as shown in Figure 8.5. 
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FIGURE 8.5 

Obtaining p =23 


□ 

It is then proved in [136] by a simple direct argument that p is obtainable if 
and only if it is 312-avoiding. We suggest that the reader try to find such a proof. 
On the other hand, as we have studied the stack sorting operation s so extensively, 
we will reduce this problem to the characterization of stacksortable permutations. 

PROPOSITION 8.31 

The n-permutation p is obtainable if and only ifp l is stark sortable. 

PROOF Let p 1 be stack sortable. This means that a certain sequence of 
movements of the entries of fi ] through the stack (namely, the sequence defined 
by s) turns p 1 into id=12--n. Carrying out the same sequence of movements on 
p A -p will turn p l -p into idp=p. In other words, we could say that we relabeled the 
entries of p l so that it became id, before sending it through the stack. So p is 
obtainable. 

Now assume that p is obtainable. That means that a certain sequence of 
movements through the stack turns id into p. Then that same sequence of 
movements turns p 1 into id. Note that this implies that at any given point of time, 
the entries of the stack were in increasing order with the smallest one on the top. 
Indeed, if i<f, and j had been above zin the stack, then j would be before zin the 
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output, so the output could not be id. So the sequence of movements that turns 
p l into id is precisely the stack sorting operation s, so s(p l )=id, and our proof is 
complete. 

As a permutation is 312-avoiding if and only if its inverse is 231-avoiding, we see 
again that p is obtainable if and only if it is 312-avoiding. 

In both approaches to stacks we have seen, the crucial property of the stack 
was that the entries entered the stack at its top, and left it at its top. This was the property 
that enabled us to change the order of some entries by passing them through the 
stack. There are other sorting devices with slightly different restrictions. One of 
them is a queue, which differs from the stack because the entries enter at the top 
of the queue, and leave at the bottom of the queue. So the entries will leave the 
queue in the order they were received (as is the case, hopefully, with a dentist’s 
waiting room). Therefore, the queue is not an exciting sorting device as the only 
permutation it can sort is the identity permutation itself. 

Knuth has considered several generalizations of stacks and queues in his ground¬ 
breaking work [136]. In all of these, his approach was to look for the set of 
permutations that could be obtained by some kind of sorting device. The most 
powerful sorting device considered in [136] is the double-ended queue, or deque. This 
is a sorting device that has the capabilities of both a stack and a queue, and even 
a little bit more. That is, entries can enter at both ends of the deque, and they can 
leave at both ends of the deque. Again, there is no restriction on the order of the 
entries that are in the deque. Let us call an w-permutationj!) general deque obtainable 
if it can be obtained from 12---W using a deque. The adjective general is inserted to 
avoid confusion as some authors use the word deque for a more restricted 
sorting device. That is, p is deque-obtainable if it can be obtained from 12 —n by 
a movement sequence, in each step of which we either place the next entry of the 
input to either end of the deque, or place the entry currently at the top or 
bottom of the deque into the output. 

Example 8.32 

The permutation 3412 is deque obtainable. See Figure 8.6 for a movement 
sequence leading to this permutation. Q 

In order to have an equivalent of Proposition 8.31, we need to define when we 
call a permutation general deque-sortable, or gd-sortable, and what the gd-sorting algorithm 
is. As the general deque is more flexible a tool than the stack, we can allow more 
flexibility in the deque sorting algorithm than in the stack sorting algorithm. That 
is, our sorting algorithm will not be deterministic, and the elements in the stack 
will not have to be in increasing order. They will have to form a unimodal sequence, 
though. The goal of the deque sorting algorithm is again to turn a permutation^ 
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FIGURE 8.6 

Obtaining^=3412 by a deque. 


into the permutation id=12--n. We define ageneric step of the gd-sorting algorithm 
as follows. 


Nondeterministic General Deque Sorting. 

START At any given step, if the deque is empty, we just place the next entry of 
the input in the deque. If there is only one entry a in the deque, then we can place 
the next entry of the input on either side a in the deque. 

Otherwise, we compare the next element of the input, pi, to the entries at the top 
and bottom of the deque, x, and y. 

(a) If^,-is smaller than exacdy one of xory, then p, is placed to the end of the 
deque where the element larger than p, is. 

(b) If pi<x and p,<y, then we can place p, to either end of the deque, and go 
back to START.. 

(c) If pl>x and ppy, then we move the smaller of x andjy to the output, and go 
back to START. 

If there are no more entries in the input, then we empty the deque, always 
moving the smaller of the two entries at the ends of the deque to the output 

END. 

Note that this algorithm assures that the entries in the deque indeed always 
form a unimodal sequence. 

DEFINITION 8.33 An nfermutation p is called general deque sortable if it can be 
turned into id using the above nondeterministic algorithm. 
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The only nondeterministic step of this algorithm is (b). Definition 8.33 means 
that if there is any series of choices at step (b) that will result in the identity 
permutation at the end, then p is called gd-sortable. See Exercise 26 for a 
deterministic version of this same algorithm. 

PROPOSITION 8.34 

A permutation p is general deque obtainable if and only if p l is general deque sortable. 
PROOF Analogous to the proof of Proposition 8.31. 

It seems that general deque sorting is a significantly more complex procedure 
than 2-stack sorting. We will see that this intuition holds true in an important 
aspect. In another aspect, however, it fails, as shown by the following result that 
was proved in a slightly different form by Pratt in [157]. 

LEMMA 8.35 

The class of general deque sortable permutations is closed. 

In other words, if q 'is general deque sortable, and p—q in the pattern containment 
order, then p is also general deque sortable. Recall that this was not true for 2- 
stack sortable permutations. 

PROOF As taking inverses preserves the pattern containment relation, it is 
sufficient to prove that the set of general deque obtainable permutations is closed. 
If q is gd-obtainable with a sequence S of movements, and p—q, then p is gd- 
obtainable by the subsequence of S that belongs to a given copy of p contained in 
q. Indeed, the presence of the remaining entries of q is not necessary for the 
entries of that jb-copy (it does not help them in any way) in order to move 
through the deque. | 

In other words, gd-sortable permutations form an ideal If in the poset P of all 
finite permutations ordered by pattern containment, and permutations that are 
not gd-sortable form a dual ideal in I gi P. The minimal elements of I gd (or any 
ideal, for that matter), form an antichain A. If A were finite, then we could 
characterize I gd as the set of permutations avoiding all of a finite number of 
patterns, namely the patterns in A. This turns out to be not the case, however. 

THEOREM 8.36 

The antichain of all minimal elements ofI gd is infinite. 


2004 by Chapman & Hall/CRC 


Get Them All. Algorithms and Permutations 
PROOF See Problem Plus 6. | 


305 


No simpler characterization is known. So even if the gd-sortable permutations 
form a closed class, and the 2-stack sortable permutations do not, the latter are 
easier to characterize, and even, enumerate. It is probably not surprising after 
this theorem that the enumeration of gd-sortable permutations of length n is not 
resolved yet. However, if we take away from the very strong abilities of the 
deque, we can succeed in the analogous task. 

We will say that a deque is input-restricted if entries can only enter at the top of 
the deque, but can leave at both ends. Similarly, we say that a deque is output- 
restricted if entries can only leave at the bottom of the deque, but can enter at both 
ends. 

The reader probably senses some symmetry between the two sorting devices, 
and as we will see, the reader is right. So for a while we will restrict our attention 
to input restricted deques. In line with our previous definitions, we will say that 
an w-permutation p is ir-obtainable if we can obtain it by sending the identity 
permutation 12 —n through an input-restricted deque. For the definition of ir- 
sortability, we first need to define the sorting algorithm. This algorithm is a somewhat 
simplified version of the gd-sorting algorithm, which was explained immediately 
preceding Definition 8.33. The simplification is that this is a deterministic algorithm. 


Determinstic IR Deque Sorting. 

START At any given step, if the deque is empty, we just place the next entry of 
the input in the deque. 

Otherwise, we compare the next element of the input, p„ to the entries at the 
top and bottom of the deque, x, and y. 

(a) lipi<x, then put p, on top of x in the deque, and go back to START. 

(b) HpP>x, and the deque is not monotone decreasing starting at the top, then 
move the smaller of x andjy in the output, and go back to START. (If there 
is exactly one entry in the deque, that is considered a monotone decreasing 
deque.) 

(c) If Pj>x, and the deque is monotone decreasing starting at the top, then 
put on top of x in the deque, and go back to START. 

If there are no more entries in the input, then we empty the deque, moving 
always the smaller of the two entries at the ends of the deque to the output. The 
output permutation is denoted ir(p). END. 

Note that this algorithm assures that the elements of the deque always form a 
unimodal sequence. Now we can call an w-permutation p ir-sortable if ir(p)=id. By 
now it should be obvious that p is ir-obtainable if and only if/; 1 is ir-sortable. 
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Knuth [136] has computed the number of ir-sortable n-permutations by a 
generating function argument. His work was completed before the pattern 
avoidance explosion, that is, the bonanza of enumerative results on permutations 
avoiding certain patterns. We will now show how to reduce the problem of 
enumerating ir-sortable permutations into a problem of enumerating pattern 
avoiding permutations. 

Let us first try to characterize the set of ir-sortable permutations. It is easy to 
check that all permutations of length three or less are ir-sortable. The problems 
start at length four, when we cannot ir-sort 4231, and we cannot ir-sort 3241. 
This is not an accident. 

LEMMA 8.37 

If p contains 4231 or 3241, then p is not ir-sortable. 

PROOF Assume p contains 4231, and let «<£<e<d form a copy of 4231 in p. In 
other words, these four entries follow in the order dbca in p. So d will enter the 
deque first. If p is to be ir-sortable, then b has to enter the deque before d can 
leave it, otherwise d precedes b in the output. Now b is above d, and therefore b 
has to leave the deque before c arrives, let alone before a arrives. This means 
that b will precede a in the output. | 

Similarly, assume p contains 3241, and let x<y<r<z form a copy of 3241 in 
p. So the order of these four entries in p is vyzx. Then v enters the deque first. 
Then, if p is to be ir-sortable, y enters before v can leave. Now y is above v, so 
it has to leave the deque before z arrives, meaning that y will precede x in the 
output. 

Having seen how difficult the problem of characterizing permutations that are 
sortable by some device can be, the reader might be bracing for some complicated 
counterpart of Lemma 8.37. If this is the case, then the reader will be happily 
surprised. 

THEOREM 8.38 

The permutation p is ir-sortable if and only if it avoids both 4231 and 3241. 

PROOF As Lemma 8.37 proves the “only if” part, we only have to prove the 
“if” part. 

We prove the contrapositive. Let p be such that ir(p)¥=id. Let a<b be two 
entries that form an inversion in ir(p), that is, b precedes a in ir(p). It is 
straightforward to see that in this case, b has to precede a in p, too. (Recall that 
the deque is unimodal.) What forced b to leave the deque before a arrived? The 
only step of the sorting algorithm when an entry has to leave the deque is (b). 
Therefore, b had to leave the deque at a point when b was the smaller of the two 
entries b and c at the two ends of the deque, the deque was not monotone 
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decreasing, and an entry x larger than both b and c was the next entry of the 
input. At that point, one of the following two situations had to occur. 

1. The entry b was at the top of the deque, and c was at the bottom. That 
means c precedes bmp, and so cbxa is a 3241-copy in p. 

2. The entry c was at the top of the deque, and b was at the bottom. As we 
know that the deque was not monotone decreasing, there had to be an 
entryy in the deque that was larger than c. Thcny precedes cmp, and the 
subsequence ycxa is either a 3241-pattem, or a 4231-pattem, depending 
on which one of x andy is larger. 


■ 


COROTT. ART 8.39 

Let a n be the number ofir-sortable n-permutations, with Oq= 1. Define r,=a n+x . Then we have 



n> 0 


PROOF This follows from the soludon of Exercise 9 of Chapter 4. So the ir- 
sortable permutations are enumerated by the large Schroder numbers. 

Note that this implies that r n = What can be said about 

output-restricted deques? These are deques that allow entries to leave at the top 
only, but allow entries to enter at either end. If we want to sort a permutation p 
by such a deque, it is obvious that we have to keep the deque increasing from top 
to bottom, which makes the definition of the rest of the sorting algorithm easy. 
The reader should try to solve Exercises 24 and 25 for the details. 


Exercises 


1. Take a permutation of length n. Assign a direction (Left or Right) to each 
entry, such as in 2 £ 3j j 4 i l f> L . Call an entry x winning if its neighbor in the 
direction given to x exists, isy, andy<x. So for instance, in 2 £ 3;?4 i l f> L , the 
entries 4 and 5 are winning. 

Now consider the following algorithm. Start with the permutation p x = 
\p2 L —n L . Finding the largest winning entry x, and interchange it withy. 
Finally, change the direction of all entries z if z>x. Call the obtained new 
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permutation^. Then apply this same procedure topi, and call the resulting 
permutation^, and so on. Prove that this procedure will generate all n\ 
permutations of length win the first w! steps. 

2. Which w-permutation will be generated last by the above algorithm? 

3. Find an algorithm to generate all w-permutadons with a fixed number k of 
inversions. 

4. Find an algorithm to generate all w-permutations with a fixed major index m. 

5. Prove that all w-permutations are (w-l)-stack sortable. 

6. Is there a permutation p so that t(jj)=2413657? 

7. Characterize and enumerate all w-permutations that are not (w-2)-stack 
sortable. 

8. Characterize and enumerate all w-permutations that are not (w-3)-stack 
sortable. 

9. Prove that W,(n)<(t+l) 2n . 

10. In the proof of Theorem 7.32 we have seen a bijection that (after taking 
reverses) maps 231-avoiding, so stack-s or table, w-permutations with k 
descents into such permutations with n-\-k descents. Was that bijection 
the same as the bijection f of Theorem 8.18 in the special case t=Y? 

11. Consider the following modification of the /-stack sorting operation. Instead 
of passing a permutation through a stack t times, we pass it through t 
stacks placed next to each other in series as follows. The first stack operates 
as the usual stack except that when an entry x leaves it, it does not go to 
the output right away. It goes to the next stack if x>j, where j is the entry 
on the top of the next stack. If j<x, then x cannot move until j does. 

The general step of this algorithms be as follows. Let *Sj, S 2 , —, S, be the t 
stacks, with a, being the entry on top of stack S,. If the next entry x of the 
input is smaller than a h we put x on top of .S). Otherwise, we find the 
smallest i so that ai can move to the next stack (that is, that or 5,+! 

is empty), and move a, on top of *S) + i. If we do not find such i, or if tSj, >S>, 
■■■, Sti and the input have all been emptied out, then we put the entry on 
the top of S, into the output. 

(a) Let t= 2. What is the image of 231 under this operation? 
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(b) Characterize all permutations p be a permutation whose image s,(p) 
under the operation defined above is the identity permutation. 

The above sorting algorithm is sometimes called right-greedy [12] as we 
always make the rightmost move possible. 

12. Prove that W,(n) cannot be larger than the number of Standard Young 
Tableaux of shape (i+1) x n in which row i does not contain two consecutive 
integers, for 2<z</. 

13. Prove that for any fixed n, the sequence W lt2 (n, ^ 0 sfc=«-i is log-concave. 

14. Prove that for any fixed t, and any even integer n, the number W t (n) is 
even. 

15. Define the Injection/’ of Definition 8.23 without using decreasing binary 
trees. 

16. A fallma permutation p=pip 2 ---p n is an index zso that ^,. 1 >P,>^, +1 . A rise is 
an index i so thzt ppi<ip<p M . Prove that the number of /-stack sortable n- 
permutations with k rises is equal to the number of /-stack sortable n- 
permutations with k falls. 

17. Let P m _, be the poset of all finite permutations that are not stack sortable, 
ordered by pattern containment. Does P m _, contain an infinite antichain? 

18. Call a permutation p sorted if there is a permutation q so that s(q)=p. At 
most how many descents can a sorted w-permutation have? 

19. Let p and ^be two w-permutations so that s(p)=pr and s(q)=qr for some r 
eS„, where pr means the product (dp and r in S n . Prove that the decreasing 
binary trees T(p) and T(q) differ only in their labels, not in their underlying 
trees. 

20. Is there a permutation pattern r so that if p contains r, then p cannot be 
sorted? 

21. For P € S(n) > let b p be the unique w-permutation so that pb p =s(p), where 
the left-hand side refers to the product in S„. Let 


B = {b p \p £ S n }. 

How many elements does B have? 

22. How many permutations b £ S n are there so that there exists exactly one 
p £ S n so that b=bp ? 
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23. For which w-permutations p is .f 1 (p) the largest (consists of the largest 
number of permutations) ? 

24. Define the deterministic sorting algorithm or that uses an output-restricted 
deque. 

25. (a) Characterize permutations that are or-sortable. 

(b) Deduce that the number of or-sortable w-permutations is equal to the 
number of ir-sortable w-permutations. 

26. Define a deterministic version of the general deque sorting algorithm that 
turns the w-permutation p into 12--W if and only if p is gd-sortable. 

27. Let us call an w-permutation p separable if 

(a) p= 1, or 

(b) p=LR, where L and R are both separable permutations (after 
relabeling), and the entries of L are either the smallest |L| elements of 
[n], or the largest |Z| elements of [n]. 

Find a characterization of separable permutations in terms of pattern 
avoidance. 

28. Pop-stark sorting is a weaker version of stack-sorting. The entries of the 
permutation to be sorted, p, enter the pop-stack one by one, just as in 
stack-sorting, as long as the entries in the pop-stack form an increasing 
sequence from top to bottom. However, when this is no longer possible, 
that is, when the next entry of/? is larger than the entry on the top of the 
pop-stack, then the entire pop-stack must go into the output. Call p pop- 
stack sortable if the image of p by this deterministic algorithm is the 
identity permutation. Characterize pop-stack sortable permutations in terms 
of pattern avoidance. 

29. Which permutations are sortable by ^parallel queues? That is, we have t 
queues, the entries can only enter them at the top, and leave at the 
bottom, but for each entry of the input, we are free to decide into which 
queue that entry will enter. Once an entry leaves its queue, it goes 
immediately into the output. 

30. Consider the following two modifications of the sorting algorithm of 
Exercise 11. 

(a) Sorting by t stacks in series. This is an undeterministic algorithm defined 
as follows. In any given step, move one entry from the top of a stack 
Si to the top of the next stack S M so that the new stack remains 
increasing from top to bottom. (Recall that S 0 is the input and S, +1 is 
the output.) In other words, omit the requirement that in each step 
we have to do this with the stack of the smallest index Tor which this 
is possible. 
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(b) Left-greedy sorting through t stacks. That is, in each step of this 
deterministic algorithm, we find the largest index i so that we can 
move the entry from the top of S, to S i+1 without violating any 
constraints. That is, the Si, S 2 , —, S, all have to remain increasing 
from top to bottom, and the output has to be a string 123 —m for 
some m. If no such move is possible, then we say that the algorithm 
fails. 

Prove that if t= 2, then the algorithms defined in (a) and (b) are equivalent, 
that is, they sort the same set of permutations. 


Problems Plus 

1. Prove that for all n and k, we have 


Wt / __ (n. + fc)!(2n - fc - 1)! _ 

21 ’ } (k + l)!(n — k)l(2k + l)!(2n — 2k — 1)! ’ 


(8.3) 


2. A j3(l, 0)-tree is a rooted plane tree whose vertices are labeled with positive 
integers according to the following rules. 

(a) The label of each leaf is 1, 

(b) the label of each internal node is at most the sum of the labels of its 
children, 

(c) the label of the root is the sum of the labels of its children. 

See Figure 8.7 for an example. 

Prove that the number of b{ 1, Octrees on n +1 vertices is equal to W 2 (n). 

3. Prove that there exists a simplicial complex A so that the set of (k-V)- 
dimensional faces of A is in bijection with the set of 2-stack sortable n- 
permutations having k ascents. 

4. Sorted permutations were defined in Exercise 18. Let p be a sorted 
permutation, and denote s l (p) the set of its preimages under the stack 
sorting operation. Prove that there is always a permutation q £ s~ 1 (p ) 
that has strictly more inversions than all other elements of P (p). 

5. Find an algorithm that decides whether a given permutation is sorted. 

6. Find a sufficient and necessary condition for permutations to be gd- 
obtainable. 
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FIGURE 8.7 

A P(l, 0)-tree. 

7. Recall the definition of sorting by t stacks in series from Exercise 30. Let 
t= 2. Characterize the set of permutations that are sortable by this 
procedure. We will call these permutations 2-series-sortable. 

8. Find a formula for the number of permutations of length n that are 2- 
series-sortable. 

9. Modify the sorting procedure of Exercise 11 by making it undeterministic 
as follows. In any given step, we can either 

(a) put the next element of the input into any stack S„ for any i £ or 

(b) put the entire content of stack Sj into the output for some j £ [(] 
(without changing the order of the entries). 

Note that the stacks S\ are in fact pop-stacks in this model. This collection 
of pop-stacks is referred to as tpop-stacks in parallel. Let us call a permutation 
p sortable by t pop-stacks in parallel if there is a sequence of steps of the 
above two kinds that turns p into the identity permutation. 

Is it true that the set of all permutations that are sortable by / pops tacks in 
parallel forms a closed class? 

10. Let f(n) be the number of all w-permutations that are sortable by 2 pop- 
stacks in parallel. Prove that f(n)=n\ if 3, and 

f(nhWn-l)-mn-2)+PJWZ) 


if tz>3. 
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11. In part (b) of Exercise 30, we defined the left-greedy algorithm. Define 
the right-greedy algorithm in an analogous way. Let f>l . Is it true that if the 
permutation^ is sortable by the right-greedy algorithm on ^stacks, then it 
is also sortable by the left-greedy algorithm on t stacks? 


Solutions to Problems Plus 

1. We have mendoned in the text that 2-stack sortable w-permutations are in 
bijection with nonseparable rooted planar maps on n +1 edges. That 
bijection [77] turns the number of descents of the permutation into the 
number of vertices of the map, and turns the number of ascents of the 
permutation into the number of faces of the map. However, rooted 
nonseparable planar maps have been enumerated according to their 
number of vertices and faces in [52], and have been found to be counted 
by the expression on the right-hand side of (8.3). An alternative solution 
will be mentioned in the solution of the next Problem Plus. 

2. See [65] for a short proof. Also note that f}( 1, 0)-trees on n +1 vertices 
having k leaves are in bijection with 2-stack sortable n-permutations having 
k descents. As the former have been enumerated in [64] and have been 
shown to be counted by (8.3), this provides an alternative proof for the 
previous Problem Plus. 

3. This result was published in [38]; see that paper for the details that are 

omitted here. Denote by the set of all /}(1, Octrees on n +1 nodes 

having k internal nodes. Our plan is as follows. To each /J(l, 0)-tree 
T G we will associate a Ttuple (T U T 2 , ■ ■ ■ ,T k ) € [D^fJ} k of 

/j(l, 0)-trees, in an injective way. By the solution of the previous Problem 
Plus, this is equivalent to the statement to be proved. 

First, we specify the order in which we will treat the k internal nodes of T. 
To that end, we extend the notion of postorder from binary plane trees 
to plane trees the obvious way. That is, for each node, read its subtrees 
from left to right, then the node itself, and do this recursively for all 
nodes. This rule linearly orders all nodes of % and in particular, turns our 
set { Ij,-", Vi) of internal nodes into the T tuple (Ij,—, V t ) of internal nodes. 

Let i € [k] and let V,hc the zth internal node of our j3(l, 0)-tree T. Let V L 
have d t descendents, excluding itself. Denote by l the number of nodes of 
T that precede V, in the postorder reading of T. Similarly, denote by r, the 
number of nodes of T that follow V in the postorder reading of T. 
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FIGURE 8.8 

Decomposing a /3(1, 0)-tree . 

Define T, as the unique (id, 0)-tree with one internal node Z so that Z 
has di descendents, and the root of 7” has l\ leaf-children on the left of Z 
and r, leaf-children on the right of Z- The only node whose label has to be 
defined is the only internal node Z> and we set label(Zi ) = label-r(V). 

We show that we can indeed always set labelx: ( Zi ) = labelj (Vi), that is, 
labelf(V) is never too big for the label of Z- Indeed, Z has d children, all 
leaves, so any positive integer at most as large as d, is a valid choice for 
the label of Z- On the other hand, fjhas d t descendents in % so labebr(V)< 
d{, and therefore labelf(Vj is a valid choice for labelT^Zi)- Our 
decomposition map h is then defined by h(7)=(T h Z,"-, jQ. See Figure 
8.8 for an example of this map. 

One can then show by induction on k that the map h : D » 
{D'n'+i i J defined by h(T)=(T h T 2 ,—, T/) is an injection. 

Note that h is not a surjecdon. Indeed, for (2j, Tj,---, Tj) to have a preimage, 
we must have This also shows that 4-tuples of j3(l, 0)-trees 

with one internal node, and 4-element sets of /?(1, 0)-trees with one internal 
node are equivalent for our purposes. 

Finally, we show that the defining property of simplicial complexes 
holds for our construction. That is, we show that if there exists a /?(1, 0) 
-tree T so that h(T)={T 1 , Z t ), then for any subset 
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FIGURE 8.9 

Removing an internal node. 

7 = ‘ C [k], there exists a /J(l, 0)-tree Tj so that 

h{ Tj) {7/, 7/, • ■ *, 7y. 

It is clear that all integers hi, h 2 * * * h, are different as event the integers 
4, 4> ■■■) 4 are all different. Relabel the elements of 7so that they are in 
increasing order. This gives us a/tuple (7),, 7/, • ■ •, 7/) of /J(l, 0)-trees 
with one internal node. 

Assume first that /has A-1 elements, with z being the missing element. 
Then we construct 7/ from 7 as follows. Let F[be the ith internal node of 
7. Remove V t from 7, and connect all its children to the father F t of V„ 
preserving their left-to-right order. Say the rightmost child of V t was 7l r 
Now add V back to the tree so that it is a child of F, and it immediately 
follows Ri in the left-to-right list of the children of F r Note that now V, 
became a sibling of one of its former children, and it became a leaf. 
Therefore, we have to change the label of V, to 1, but we do not have to 
change any other labels. Indeed, there is no other node who lost 
descendents in this operation. (The only exception is when Vi was a child 
of the root. In that case, we may have to change the label of the root, but 
that will not cause any problems.) Call the j8(l, 0)-tree obtained this way 
T. See Figure 8.9 for an example of this procedure. 

It is straightforward to verify that h(Tj) = (7/ , Tj 2 , • - • , Ti h _ 1 ). If /has 
less than AT elements, then we construct 7} by iterating this procedure. 

4. This result was proved by M.Bousquet-Melou, and can be found in [45]. 
She defines two operations on decreasing binary trees that increase the 
number of inversions in the corresponding permutation, but do not change 
the postorder reading of the tree. One operation concern vertices that 
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FIGURE 8.10 

Finding the canonical preimage of 21346578. 

have a left subtree only, and turns that subtree into a right subtree. The 
second operation concerns vertices v that have a left child a and a nonempty 
right subtree T so that the leftmost vertex y of T is larger than a. Then the 
operation takes the subtree rooted at a and moves it so that it is the right 
subtree of y. See Figure 8.10 for an example. 

The author then proves by induction on the size of the trees that these 
operations will always lead to a unique tree She calls that permutation 
corresponding to that tree a canonical permutation. Note that neither of 
the two operations defined above can be carried out on T can , which shows 
some properties of this tree. 

5. A recursive algorithm to decide this question was developed by M.Bousquet- 
Melou in [44], She uses a decomposition first mentioned in [207]. Let p be 
an w-permutation that has k-1 descents, or, in other words, k ascending 
runs. Let these ascending runs be r h r 2 ,—, r k . We have seen in the text that 
taking the stack sorted image of a permutation q is nothing else but taking 
the decreasing binary tree T(q) of q and then reading it in postorder. Now 
assume s(q)=p. Then any time we pass in p from one ascending run to the 
next, in the postorder reading of T q , we must jump from the left subtree of 
a vertex to the right subtree of a vertex as that is the only way we can 
obtain a descent. This observation justifies the following algorithm. 
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FIGURE 8.11 

Finding a preimage for j!?=213547689. 

To obtain a permutation q so that s(q)=p, take the ascending run 
decomposition r h r 2 ,--, r k of p, and for each i, take the decreasing binary 
tree that is corresponding to the reverse of r t . Denote these trees by T„ for 
i e [k\ Attach a left-half edge to the top of these trees. Now try to build 
up q right to left as follows. In Step 1, find the smallest element in r k that 
is larger than the largest element in r hA , and attach l' t .\ to that vertex as a 
left subtree. If there is no such element (which would mean xhaip did not 
end in n), then p is not sorted. Otherwise, continue this way. In step i, find 
the smallest element in the leftmost path of the tree already created that 
has no left child and is larger than the largest element of T k . v If there is no 
such element, then the previous paragraph implies that p is not sorted. 
Otherwise, we continue this procedure till an w-vertex tree is created. In 
that case, p is sorted as p=s(q), where the permutation corresponding to 
created tree is q. Note that the permutation q will be the canonical preimage 
of p. See Figure 8.11 for an example. 

6. This result was proved in [157] in a slighdy different form. The set of 
minimal non-gd-sortable permutations is ^=52341, « 2 = 5274163, a$=l 2 
9 4 1 6 3 8 5, 5, <z 4 =9 211416385 10 7, and so on, and all 
permutations that can be obtained from the h, by interchanging the two 
maximal entries, and/or interchanging the first two entries. Note in 
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particular that in the ai, the even entries are fixed points and the odd 
entries are cyclically translated (except in a±). 

7. It is proved in [12] that these are the permutations that avoid all patterns 
of the form ^„=2(2m-l)416385"-2m(2m-3), for each m>2. In other words, 
in position 2z-l, the pattern q m contains the entry 2z, and in position 2z the 
pattern q m contains the entry 2z-3 if z>l, and the entry 2??z-l is in the 
second position of q m . 

8. It is proved in [12] that the number A(n) of these permutations is equal to 
S, ,(1342). We have proved an exact formula for S„(1342) in Theorem 
4.31. 

9. Yes. In fact, more is true. It is proved in [11] that for any t, there exists a 
finite set F, of permutation patterns so that p is sortable by ^pop-stacks in 
parallel if and only if p avoids all patterns in F t . So the class of these 
permutation is not just an ideal, but an ideal generated by a finite number 
of elements. The proof is constructive, so the elements of F t can actually 
be found. For 1—2, there are seven of them. 

10. This result is due to M.Atkinson [11]. 

11. Yes, that is true. See [173] for a proof. 
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Solutions for Chapter 1 

1. We have 


a( 5) = f n V n ' si )f n_S2 V--f ri_s ^ 

\SlJ \S 2 - Si/ \ s 3 - S 2 ) \n - SkJ 

_ n\ _ (n - Si)! _ (n - s k )l 

si!(n — Si)! (s 2 - si)!(n - s 2 )! (n - s k )l 

nl ( n 


si! • (s 2 - si)!.(n - s k )l \si,s 2 - si, • ■ • ,n - s n 


3. If p^ipi—p„ has k descents, then its complement p clearly has n-\-k descents. 
Here p is the w-permutadon defined by (p') t = n+l-p t . 

5. Look at the sequence where b=a/a iA . Then {a,}, is logconcave if and 
only if {b;}j is weakly decreasing, while {«,}, is unimodal if and only if once 
{£,}, gets to a number that is not larger than 1, it never grows back above 1. 
As this second condition on {£,}, is obviously weaker than the first, the 
statement is proved. 

7. We prove the statement by induction on n. Our claim is true for n=() as 
x° =1, and ?z=l as x = 0 ■ + 1 * (“). Now assume that we know 

the statement for n. Multiply both sides of (1.2) by x to get 


x n+i^j2A(n,k)f X + n ~ k )x 

k =0 \ J 


( 1 ) 


Now note that 


x + n — k\ , fx + n + l — k 
}x = k[ 

n J V n +1 


+ (n - k + 1) 


x + n — k 
n + 1 


( 2 ) 
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Therefore, the right hand side of (1) equation becomes 


A(n, k)k 


k =0 


X + n + 1 — k 
n +1 


+ A(n, k)(n — k + 1) 


x + n — k 
n + 1 


fc=o 


x + n + l — k\ 

n + 1 ) 


A(n, k) + (n — k — 1) 

x + n + 1 — k 


x + n — k + 1 
n + 1 


A(n, k — 1) 


A(n + 1, k) 


n A 1 


where the last step uses the result of Theorem 1.7, with k-\ playing the role 
of k. Comparing this to the left-hand side of (1) proves the statement. 

9. The crucial observation is that the sum on the right-hand side of the equation 
to be proved has n-k +2 terms. This suggests that we compare it to what we 
get if we apply Corollary 1.18 to compute A(n, n-k+ 1)= A(n, k). (The 
summation 1.18 would have only n-k +1 terms, but an additional term in 
which z=0 can be added to it without changing its value.) That Corollary 
gives 


n —&+1 / \ 

A(„,„-i + l)= £ 5(«,r)r! „ + ":^ r (-I)—*- 

r=1 ' ' 

n—k+l / \ 

= £ S(n,r)r\( n ~ r )(-ir+ 1 - k -\ 

r =1 ' ' 

which, after the substitution h—n-r, is just what was to be proved. 

11. This is a classic result due to Frobenius [98]. Compare the coefficients of x‘ 
on the two sides. On the left-hand side, it is A(n, i). On the right-hand side, 
it is 


£ { n _l)s(n,k)kl(-l ) n ~ k ~ i+1 

k =0 ' 


Setting h=n-k, this is precisely the result of Exercise 9. 

13. For each permutation p € S n , let f(p) be the reverse complement of p, that 
is, the permutation whose i th entry is n+l-p n+] _ t . This sets up a bijection from 
the set of excedances of p onto the set of weak excedances of (p‘) r that are less 
than n. Indeed, if z is an excedance of p, then z>l, so n+l-i<tn. On the other 
hand, n is always a weak excedance of any permutation, so in particular, of 
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(p')'. This proves that the number of w-permutations with k -1 excedances is 
the same as that of w-permutations with k weak excedances. Our statement 
is then proved by Theorem 1.35. 


15. (a) Set E 0 =E 1 =1. Construct an alternating permutation of length n +1 as 
follows. Place the entry n +1 in any of the n +1 possible positions, say in 
position k+1. Then there are (J) ways to choose the k entries that will 
precede the entry n+ 1. 

Call a permutation reverse alternating!! its descent set is (2, 4, •■•}. It is clear 
that the number of reverse alternating w-permutations is E n as the 
complement of an alternating permutation is reverse alternating and 
vice versa. 

If k+1 is odd, then we can simply create an alternating permutation on 
the k entries that precede n in E k ways. Then we can take a reverse 
alternating permutation on the n-k entries that the entry n +1 precedes 
in E n _ k ways. The permutation obtained this way is always alternating. So 
the total number of choices here is (”) IS/c-E’n-fc- 

If k+1 is even, then everything is the same, with the only exception that 
the substring preceding n+1 is reverse alternating, and the substring 
succeeding n+1 is alternating. The permutation we obtain this way is 
always reverse alternating. 

As each alternating and reverse alternating permutation is obtained this 
way, this yields 


for 1. 
(b) We have 


2-Ejj+i — 



EkEn—k 


n 

e (x f = Y,J2 EiEn ~* 

n>0 i=0 


X 


n 


il(n — i)! 


On the other hand, 


E'(x) = Y, 


E n +i 


n> 0 


Comparing these two equations to the recursive formula obtained in 
part a, we get E(x) 2 =2E’(x)-l. Solving this differential equation yields 

E(x)=tan x+sec x. 

We note that the numbers E„ are called the Euler numbers, not to be 
confused with the Eulerian numbers. Moreover, the numbers E 2n are 
often called the secant numbers and the numbers E 2n+1 are often called the 
tangent numbers. 
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17. Let us insert the entry n into a permutation of length n -1 with k-1 distinct r- 
descents so that the number of r-descents does not change. To do this, we 
can do one of three things; either we insert n between two entries p, and p l+1 
so that p;>pj+i+r, or we insert n in the last position, or we insert n so that it 
immediately precedes one of the r -1 entries that are larger than n-r. Altogether, 
this will give us [k-l+l+r-1) A(n- 1, k, r)={k+r- 1) A(rt 1, k, r) permutations 
enumerated by A (n, k, r) in which n is not part of any r-descent. 

Now let us insert entry n into a permutation of length n -1 with k-1 distinct r- 
descents so that the number of r-descents increases by one. We know from 
the previous paragraph that there are k +r-1 possible insertions of n into 
each permutation enumerated by A(n-1, k-1, r) that do not increase the number 
of r-descents. Therefore, there are n-k-r+1 that do, providing us with (n-k- 
r+2) A(n-1, k-1, r) permutations enumerated by A (n, k, r) in which wis part of 
an r-descent. Summing for the two cases, we complete the proof. 

19. No, they are not the same unless r=\=l. Indeed, just by looking at the 
definitions of these numbers, we have Sfc=i r) = n! for any fixed r, 

but we also have Yf'k=i T((n, k, l) — nl l for any fixed /. 

21. Assume first that x is a positive integer. Then the left hand side is just the 
number of ways to color the elements of [n] so that the color of each element 
is chosen independently from the set [xj. The righthand side is the same, 
counted by the number of colors actually used. Indeed, if m colors are used, 
then they define a partition of [n] into m blocks in S(n, m) ways. Then the 
colors of the blocks can be chosen in x(x-l)—(x-m+l)=(x) m ways. This proves 
the claim if x is a positive integer. 

Otherwise, note that both sides of 21 are polynomials in x. They agree 
for infinitely many values of x (the positive integers), so they must be 
identical. 

23. No, that is false. Indeed, let P(x)= 1+X+3X 2 and let Q[x)=l+x+4x?. Then 
P( x )Qi x )= l+2x+8x 2 +7jc 3 +12:d fails to be unimodal. 

25. (a) Assume that p has two alternating runs and starts with an ascent. Such 
permutations increase on the left of n and decrease on the right of n. 
Therefore, we can choose the set of entries that precede n in 2”' 1 -2 ways, 
and each such choice will correspond to one permutation. As pt has the 
same number of alternating runs as p, this proves G(n, 2)=2 (2” _1 -2) =2”- 
4, ifw>2. 

(b) One way to find a formula for G(n, 3) is by using Lemma 1.37 with ^==3, 
and the above result for G{n, 2), to get that for 3, we have 

G{n, 3) = 3 G(n -1,3)+ ( 2 n - 8) + 2 (n - 3), 
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G(n, 3)-3G(n-l, 3)=2”+2n-14, 


and then by solving this recurrence using the ordinary generating function 
of the sequence G{n, 3). 

To do that, let G(x) = ]T n > 3 G(n, 2>)x n . Then the previous equation 
leads to 


G(*)(l -3x) = 


8a; 3 2x 

1 - 2x (1 — x ) 2 


— 4a; 2 - 2a: — 


14a; 3 

1 — x' 


_ 8a: 3 2x 

G{x) - (1 _ 2a ,)(i_ 3a .) 4 (i _ *)»(1 _ 3ar) 

/ 4a: 2 + 2a; 14a; 3 \ 

l 1 — 3a; (1 — z)(l — 3a;)/ 


Then we get G(n, 3) as the coefficient of x n on the right-hand side, 
that is, 


3” +1 _ 2n— 3 

G(n, 3) = 8(3 n “ 2 - 2” _2 )+-—-10 • 3 n “ 2 - 7(3”" 2 -l) 


3 n + 11 


>n+1 


for all 3. 


27. If n is odd, then the equivalent of Proposition 1.43 is a bit more cumbersome. 
We again use symmetry by taking complements, but instead of assuming 
pi < ip 2 , we assume that p 2 <p 3 - Taking G„, m (x) then adds the restrictions p^ps, 
pf<lp 7 , p n -\Gp n . Then it is straightforward from the definition of tjp) that 
t m (p)=d(p), with the convention that the singleton^ has 0 runs. 

Therefore, if n is odd, then we have 


G n ,m{x) = Y. = E xd(P) ■ 

pes n p€S n 

P2<?Z P2<P$ 

Then we can repeat the argument of Lemma 1.42. Indeed, the coefficient of 
x 4 in G n ,m(x) equals the cardinality of V (n , k), the subset of V (n, k) in which 
the edges a 3 , « 5 , ■ ■ ■, a 7 are horizontal. And the fact that the \ V{n, k)\ are log- 
concave can be proved exactly as the corresponding statement for the 
|V(n,k)| = U(n, k), that is, by taking the relevant restriction of <1>. 

29. We prove the statement by induction on k, the initial case of WO being 
trivially true. Now assume the statement is true for k, and prove it for k+1. 
We know from Theorem 1.7 that 


A(n, k+\)-(k+\)A(n-\, k+\) = (n-k) A(n- 1, k). 
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9 



FIGURE 9.12 

The decreasing binary tree of 263498175. 


Here the right-hand side satisfies a polynomial recursion by the induction 
hypothesis, so the left-hand side will satisfy that same polynomial recursion. 
Rearranging that recursion, we get a recursion for A(n, k). 

31. For all p € S n , define the decreasing binary tree T(p) as follows. The root of 
T(p) is n, and the left (resp. right) child of wis the largest entry oip on the left 
(resp. right) of n. Then define the rest of the tree recursively. See Figure 9.12 
for an example. 

It is clear that T is a Injection from S„ to the set of all decreasing binary trees. 
Indeed the unique preimage of a decreasing binary tree can be read off the 
tree in order, that is, for each node, read the left subtree first, then the node 
itself, and then the right subtree. 

33. If a permutation^ has exactly k peaks then there are two possibilites. 

(a) When^!<^. In this case, 'ri.p^ft>p n , then p has 2k alternating runs, and if 
pn-i^pn, then p has 2A+1 alternating runs. 

(b) When pft>ph- In this case, iip nA ~>p n , then/> has 2k+\ alternating runs, and 
ri p n i<t.p n , then/> has 2k+2 alternating runs. 

In other words, the set of w-permutations with k peaks consists of the entire 
set of w-permutations with 2k+l alternating runs, half of the ^permutations 
with 2 k alternating runs, and half of the w-permutations with 2A+2 alternating 
runs. This proves 


Peak{n, k) = G(n, 2k + 1) + 


G(n,2k) +G(n,2k + 2) 
2 


35. It is easy to prove, by induction, or otherwise, that the number of descents of 
p is equal to the number of right edges of T(p), while the number of ascents of 
p is equal to the number of left edges of T(p). Now it is clear that the symmetry 
of the sequence A (n, k) k can be proved by the simple bijection that reflects 
T(p) through a vertical axis. 
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Proving unimodality is a more interesting task. Let The a decreasing binary 
tree on n nodes. We define a total order of the nodes of T as follows. A node 
v of 7’is on level j of T if the distance of v from the root of T is j. Then our 
total order consists of listing the nodes on the highest level of T going from 
left to right, then the nodes on the second highest level left to right, and so 
on, ending with the root of T. 

Let 7” be the subgraph of 7 induced by the first i vertices in this total order. 
Then 7jis either a tree or a forest with at least two components. In the first 
case, let g(T) be the reflected image of 7’ through a vertical axis. In the 
second case, if the components of 7j are Cj, C 2 , —, C„ then let g(TJ=(g(Ci), 
g(C2 ), g(C 2 )). Now prove that if k<[(n- 3)/2], then zcan always be chosen so 
lhat /fT’j has exactly one more right edge than 7”. Then an injection from 
the set of decreasing binary trees on n vertices with k right edges into the set 
of decreasing binary trees on n vertices with k+ 1 right edges can be defined 
by finding the smallest such i, then replacing 7” by g(T) is 7! 

37. (a) The first proof of this fact is due to G.Hetyei and E.Reiner, who used 
exponential generating functions and partial differential equations to get 
this result. The proof we present is combinatorial. For any i < n-1 , either 
pi is an ancestor of p M , or p M is an ancestor of pi, or else pi and p i+1 
would have a common ancestor, which would put an index between i and 
i+ 1. In particular, T™ cannot have both pi and pi+1 as leaves. This 
makes the following definition meaningful. Let i G [n — 2], Then the ith 
local extremum of a permutation p is the entry that is closest to the root of 
the minmax tree of p among p,, p i+u and p M - Denote this entry by e t . 
Note that <?, always exists. 

Assume by symmetry that the entry 1 of p precedes the entry n of p. 
We can do that as the minmax tree of ft is isomorphic to T"\ First we 
are going to prove that p l is a leaf in d/3 minmax trees. The simplest 
scenario is when the entry 1 is among the leftmost three entries, so in 
particular, q=l. This gives rise to three subcases: 

• If/>i=l, then/i! is the root of the minmax tree. 

• If p 2 = 1, then p 2 is the root, and pi is a leaf. 

• If p 3 =l, then p 3 is the root, its left subtree has pi and p 2 as nodes, 
and among these, by definition, p^ is a leaf, and pi is not. 

Clearly, these cases are equally likely to occur, so pi will be a leaf with 
probability 1/3 in this case. 

Now suppose that the entry 1 of the permutation is not among the first 
three elements. This entry is the root of and its left subtree has at least 
three nodes. Let these nodes be bi<b 2 <—<b k . 
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Repeat the previous argument for this subtree, with b\ playing the role 
of 1; now if b\ is among the leftmost three elements of the permutation, 
pi is a leaf with probability 1/3. Iterate this algorithm. It will eventually 
stop because we either get a left subtree of size three, or a subtree 
whose minimal entry is among the first three ones. This proves that 
there are d/3 minmax trees on [n] in which pi is a leaf. 

The proof for general p, is similar. Assume again that 1 is on the left of 
n in p, and let i G [n — 2]. If 1 £ {Pi,Pi+i,Pi+‘ 2 \, then there are three 
possibilities. 

• If p= 1, then pi is the root of the minmax tree. 

• It p M =l, then p M is the root, so p, is the rightmost element of its 
left subtree, and as such, it is necessarily a leaf. 

• If pn. 2=1, then pi +2 is the root, p, is the next-to-last element of the 
root’s left subtree, and as such, it is always an internal node (having 
the leaf p,+i for its only child). 

Again each of these subcases occurs with probability 1/3. 

If 1 ^ {PiiPi+uPi+ 2 }, then we can proceed as above. That is, look for 
the entry 1 of p, then only consider the subtree that contains the positions 
i, z+1 and z+2. If 1 is not in any of these positions, then all three of them 
are in the same subtree. Iterating this algorithm we eventually reach a 
subtree where we can apply the above method. The structures of the 
other subtrees do not influence whether p, is a leaf or not, so p, is a leaf 
with probability 1/3. This completes the proof. 

(b) Clearly, p„ is always a leaf because it cannot be the leftmost in any 
comparison, thus it cannot have descendants. Similarly, p„ is always the 
child of pK 1 , thus p nA is never a leaf. 

39. Any lattice path included in our sum must end either in a horizontal step 
having label k or a vertical step having label n-k+1. Then use induction and 
Theorem 1.7. 

41. Similar to the solution of Exercise 39, just this time we have to use the 
recurrence proved in Exercise 17. 


Solutions for Chapter 2 

1. There are two known solutions of these results. One [195] is by a 
combinatorial involution, and the other [71] is by a generating function 
argument. These solutions generalize in different directions. 
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FIGURE 9.13 

The six trees with no inversions. 


3. As we discussed, b(n, 3) is equal to the coefficient of x 3 in the polynomial 

I n (x)=( 1+x)(1+x+x 2 )-"(1+xH -bx”' 1 ); in other words, b{n, 3) is the number of 

weak compositions of 3 into n -1 parts, so that the first part is at most 1, and 
the second part is at most 2. 

There are ("t 1 ) weak compositions of 3 into n -1 parts. One has second part 
3, one has first part 3, and n -2 of them have first part 2. Therefore, 
b(n,3) = ("+ 1 ) -n. 

5. An elegant proof of this result is due to V.Strehl, and can be found in [185]. 

7. The number of such trees with zero inversions is n\. Figure 9.13 shows the 
six trees with root 0, non-root vertex set [3], and no inversions. To prove 
this statement bijectively, let p be an n-permutation, and define H(p) to be 
the tree on vertex set [n] U 0 defined as follows. If i is an entry otp, then the 

unique parent of z in H(p) is the vertex j, where jis the closest entry in p that 
is on the left of i, and that is smaller than i. If there is no such entry j, then 
the parent of zis 0. The reader is invited to verify that H is indeed a Injection. 

9. The following proof was given by D.Bressoud and D.Zeilberger in [50]. Let 
A=(A 1 , A 2 ,-", A,) be a partition of n-a(j), for some even j. 
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Define r/>(A) by 


m = 


i 


(t + 3 j — 1, Ai — 1, A2 — 1, • ■ ■ Af — 1 ), if t + 3 j > Ai 


(A 2 + 1, ■ ■ • A t + 1,1,1, • • •, 1) otherwise 


If the second rule is used, then there are A r 3/-/-1 parts equal to 1 added at 
the end. 

Note that the first rule sends A into a partition of n-a(/)+3j=n- a(j- 1), and the 
second rule sends A into a partition of n-a(j)-3j-2= n-a(j+ 1). So in both cases, 
cf> maps from into Uj even Par(n — a(j)). Finally, note that applying twice is 
the identity, therefore cp has an inverse, and must be a bijection. 

Another solution was given by Garsia and Milne in [102]. 

11 . Yes. We prove the statement by induction on k, the initial case of WO being 
trivial. Suppose we know that the statement is true for k-l. Lemma 2.5 tells 
us that b{n+ 1, k)-b(n, k)=b{n+ 1, k-l) if k<n. From this, it is easy to see that 
b(n, k) is also/^recursive. Finally, we mention that the constraint k<n'\s not a 
reason for concern. Indeed, it is obvious that if f and g are polynomially 
recursive functions, then so is f+g. On the other hand, for any fixed k, the 
difference of the functions b(n+ 1 , k-l) and b(n+ 1 , k)-b(n, k) is nonzero for a 
finite number of values of n only, and is therefore certainly/^recursive. 

13. There are b(n, k) such w-tuples. Indeed, let p=p]pr"p, l be a permutation, and 
define b(p)ibc the number of indices j<i so that p, < <p r Then clearly 0<b(p)i< 
z-1, and the map b:S n -^B n defined by b(pip 2 ,---, pn)=(b(p )\, b(p) 2 ,■■■, b(p)n ) is a 
bijection because the only preimage of (A, b 2 ,—, b„) can be built up from left 
to right. The value of ^ reveals the relative size of p, among the first i entries 
of p. Finally, note that Xn_i b(P)i = *(p) as b(p),\s the number ofinversions 
whose first element is p r Therefore, the set of permutations with k inversions 
is mapped to the subset of B„ in which X 4 L 1 ^ = 

We note that (b(p) 1 , b(p) 2 b(p)P), or a trivial transformation thereof is often 
called the inversion table ofp. 

15. In an n-permutation with k inversions, the entry n can be part of /inversions, 
with so 


k 


b(n, k) = ^ b(n - 1, k - i). 


17. Such a permutation is either alternating, or reverse alternating, but in both 
cases, it has k descents and k ascents. Therefore, it has to have at least k 
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inversions and at least k noninversions. These are both attainable, with the 
permutations 13254--(2A+1)(2A) and its reverse. 

19. The main idea of this solution can be found in [91]. Just as in the solution of 
Exercise 13, we are bijectively encoding a permutation by an n-tuple (d h d h 
■■■, d n ) of nonnegative integers satisfying d h <k T. This will prove that dm is 
Mahonian. We define the d h as follows. Let p=pip 2 —pn be an w-permutation. 
For fixed k, let 

\{l < k so that pk < pi < k}\ if p k < k, 

j{f < k so that pi < k} \ + \{l < k so that p k < Pi}\ 
if p k > k. 

It is easy to check that ^" =i rljp) — den(p). All we have to show is that p 
can be recovered from its Derwrt-table, that is, from the w-tuple (<4 (p), d^ip),---, 
d n (p )). Indeed, p rl =n-d n . Now last n-k elements of p have already been recovered. 
Then recover p k as follows. Look at the list k, k-1, ■■■, 1, n, n- 1, —, k+1, and 
delete all the entries that have already been assigned to a position in p. Then 
p k has to be chosen so that there are exactly d k (p) entries on its left. 

21. Let us multiply both sides of the identity by [(n-k)]! [k] ! to get 

[n] !=[nT]![k]^”"N-1- [n-1]! [n-k] 

Now let us divide both sides by [w-1]! to obtain 
[n]=[k]^+[n-k], 

which follows direcdy from the definition of [m]. 

23. This can be proved by repeated applications of the result of Exercise 21, but 
we prefer a combinatorial argument. The left-hand side provides the generating 
function for all Tsubsets of [m] according to their subset sums. 

A typical term of the right-hand side is of the form q m ~ k -}+ 1 j, where 
j £ [m — k\. We claim that this term will provide the generating function for 
those Tsubsets of [m] whose largest element is equal to m-j+1. Indeed, the 
rest of such a subset is a (Tl)-subset of the set [m-j]. The term q m -J - k+1 corrects 
the shift in the definitions of a, and c, as seen in the proof of Theorem 2.25. 

25. We claim that there is a bijection between the Ferrers shapes consisting of 
n boxes that lit within an ixk rectangle and Tsubsets of [i+k] whose sum 
of elements is n + (^T 1 ). Indeed, let The the Ferrers shape of a partition 
of n into at most i parts of size at most k. Let the row lengths of F be (/[, 


dk = <4(p) = < 
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f 2 ,-"ifk) in nonincreasing order, where some /, may be equal to 0. Then 
f k +Y\ is a Telement subset of [i+k], and the sum of its 
elements is n + ( f 1 ). As this map is obviously a bijection, our statement 
is proved by Theorem 2.25. 

27. See [203] for a bijective proof of this fact. 

29. If n is odd, then we recall that is the number of ^-dimensional subspaces of 
an w-dimensional vector space over a ^-element field. Matching each such 
subspace with its orthogonal complement, we get that 2^ =0 (—l)* = 0- 

If ?z=2 m is even, then we claim that 

I> 1)fc [k] = (! - ?)(! - 9 3 ) • • • (1 - g 2m_1 ). 

fc=o 


A computational proof of this fact can be found in [6]. Recently, a more 
combinatorial proof was given in [68]. 


Solutions for Chapter 3 

1. Interchanging the first two elements of a permutation p either increases 
or decreases the number of inversions of p. In either case, it changes the 
parity of p. So exactly half of all permutations are odd, and half are 
even. 

3. Let (a, b) be an inversion of p. That means that looking atp as a function 
from [n] to [n], we have p(i)=a and p(j)=b, with a>b and i<j. Then, by the 
definition of the inverse, we also have p 1 (a)=i and p 1 (b)=j, proving that 
(j, i) is an inversion of/d. This sets up a bijection between the inversions 
of p and p 1 . 

5 . Let f(p)=l if p is even, and \etf(p)=-\ if p is odd. In other words, f(p)=det 
A p . Note that this map yds often called sign. 

7. We claim that f must be the identity permutation. Assume not, then f 
contains a A-cycle (fifvfk), with £>1. Assume first that k>2. Let 

Indeed, gM/[))=t// 2 )=A, ^^fgifi)=M)=/ 3 - If, on the other 
hand^/has no cycles longer than two, (that is, k=2), then let g=(/ 2 ^), 
where /q is any element outside the cycle (f\f 9 ). Then as;ain Indeed, 

whereasy(gj/]))=y{/])=^. 
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9. Such a permutation has a unique longest cycle C, of length k. We have 
(”) (k — 1)! choices for this cycle, then (n-k)\ choices for the rest of the 
permutation. Therefore, our total number of choices is 

-*)■=£. 

11. As each permutation is a product of its cycles, it suffices to prove our 
statement for permutations that consist of one cycle only. This is not 
difficult as (12-A)=(12)(23)(34)-(M k). 

13. We claim that 3 is the only such value. Indeed, S 2 has two conjugacy 
classes of size 1. If «>5, then S„ has two conjugacy classes consisting of 
4 , elements. These are those that belong to types (2,1, 0, 0,—, 0, 1, 
0, 0, 0) and (0, 0, 0, 1,—, 0, 1, 0, 0, 0). For iv= 4, these classes both have 
six elements. 

We point out that much more is known. F.Markel conjectured in 1973 
that S 3 was the only finite solvable group that had no conjugacy classes of 
the same size. This conjecture stayed open for twenty years, and was 
then proved independently in [134] and [208]. However, it is not known 
at this time whether the condition that the group be solvable can be 
dropped. 

15. Multiplying both sides by /,- from the right, we get that our formula is 
equivalent to 


Compute where each sides takes the node k. For the left-hand side, we 
get 

f T C T (k) = C T (f T {k)) = CriCf^Hl) = C*( 1) = f T (k + 1). 

For the right-hand side, we get 

(123 -w)/ r(k)=/ r{(123 —re) (k))=fi(k+l). 


17. Clearly, c(n, l)=(w-l)! as such permutations have type (0, 0, - - -, 0, 1). We 
have c(n, n — 1) = (”) as all such permutations must have type (n- 2, 
1, O,--, 0), and we have (") possibilities for the single 2-cycle. 

19. We claim that 


S( - n ’ k ' ) = ^ E 


n+r 2 -|-hi-fe—n 


ri!r 2 ! ■ • -r fc ! 


where the r, are positive integers. Indeed, order the elements of [n] in 
one of n\ ways, then insert a bar after the first rj elements, then the next 
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r 2 elements, and so on. This provides a partition counted by S(n, k). 
However, each such partition will be obtained in k\-ri\r 2 \—r t \ ways, as the 
order of the elements within each block does not matter, and the order 
of the blocks does not matter. 

21. This approach to Stirling numbers was studied in [129]. Both polynomials 
have degree 2k and leading coefficient ^ 2 ^ 2 ~^ !! . This is straightforward 
to prove by induction, using the recurrence relations given in Exercise 8 
of Chapter 1 and Lemma 3.19. 

23. Let the entry n+ 1 be part of an (w-m+l)-cycle C. Then we have { ” m ) 
ways to choose the n-m remaining elements of C, and we have (w-m)! 
ways to choose C on these elements. Then, we have c(n, m) ways to 
choose the rest of the permutation. Summing over m, we get the identity 
to be proved. 

25. We claim that 

c 3 (n, k)=(n-l)c 3 (n-l, k) + (n-l) (n-2) C 3 (w-3, k-1), 

with <g(0, 0)=1. Indeed, in a permutation enumerated by £3 (n, k), the entry 
wis either in a 3-cycle, or in a larger cycle. There are n- 1 permitted ways to 
insert n into a gap position of a permutation counted by q (n- 1 , k) as the 
last gap position is forbidden (it would create a 1-cycle). The permutations 
obtained this way will contain n in a cycle longer than three. 

Otherwise, there are f"” 1 ) ways to choose two entries that can share a 
3-cycle with n, then there are two possible 3-cycles involving n and the 
two chosen elements. 

27. \jt\.p(k, n) be the probability that we draw k white balls in n trials. That 
event can occur in two different ways, either we get the k-1 white balls 
during the first n- 1 trials, or we get only k white balls during the first n- 
1 trials, and the last one during the last trial. This leads to the recurrence 
relation 

/, x k ,, m — k + l.- , 

p(k,n ) = — p(k,n — 1) + -- plk — l.n — 1). 

TO TO 

Indeed, if the first a trials resulted in drawing b white balls, then there 
are m-b white balls, and b black balls in the box. 

This triangular recurrence is somewhat similar to that of the Stirling 
numbers of the second kind. To grasp this connection better, set 

p(k,n) = d(n,fc)~~. 

Then the numbers d(n, k) satisfy the same recurrence as the numbers 
S(n, k), and fulfill the same initial conditions. Therefore, d(n, k)= S(n, k), 
and thus p(k, n) = S(n, k) ^*• 
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29. This is a simple application of Corollary 3.48. We take a partition of [n] 
into k parts, but we do not put any structure on any of the blocks. The 
only requirement is that the blocks are not empty. Therefore, E(u)= e“- 
1 for all i F [/;]. Finally, unlike in Corollary 3.48, our partitions are 
unordered, so their number is only Hk\ times what it would be if they 
were ordered. This yields 

9k(u) = £ >S '( n > fc ):^[ ~ 1 ^‘ 

n~k 


31. By repeated applications of Proposition 3.12, we have 


c(p, k) = 


£ 


F- 


01+2024- \-pa p =p 


Co! • • • a p !l ai 2 02 • • -p a p 


ai!a 2 ! 


As p is a prime, no positive integer smaller than p is divisible by p. 
Therefore, the denominator of no summand on the right-hand side is 
divisible by p. Indeed, we must always have «,=(), otherwise we would 
have a permutation with one cycle only, and that is not allowed. As p\ is 
divisible by p, the right-hand side is the sum of several integers, each of 
which is divisible by^. 

33. Let y?>K>l, and let the matrices .s and S be defined as in Theorem 3.30. 
As s-S=I, we have 

p m 

0 = £ S (P, k)S(k, n) = ^2 s(p,k)S(k,n). 

k=l k=n 

We know from Exercises 31 and 32 that s(p, k) is divisible by p unless 
k=\ or k=p. Therefore, the only member of the far-right-hand side in 
which the s(p, k) term is not divisible by p is s(p, p) S(p, n)=S(p, n). As the 
right-hand side is divisible by p, so too must be S(p, n). 

35. As ri> 1, we can divide (3.3) by x to get 

n 

(» + 1) ■ • • (a; T- n — 1) = c(n, k)x k ~ l . 

k =1 


Now equate the coefficients of W on the two sides. 

37. Let A, be the set of w-permutations in which i is a fixed point. Then 
Ajj n ••• n Ai k = (n — k )\, and the result follows by the Principle of 
Inclusion-Exclusion. 

39. One of these sets is always one larger than the other. For odd n, the set 
of B-pcrmu tations with exactly one fixed point is larger. For even n, the 
set of derangements of length n is larger. 
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To see this, let G(n) be the number of n-permutations with exactly one 
fixed point, and note that G(n)=nD{n -\) holds for all n>3. Indeed, to find 
a permutation counted by G(n), first choose its only fixed point, then 
choose a derangement on the remaining n -1 entries. On the other hand, 
it follows from Corollary 3.57 that D(n)=nD(n- 1)+(T)”. This proves our 
claim. 

41. The notion of desarrangements, and the proofs below, are due to J. 
Desarmenien [70]. 

(a) Let p be a desarrangement of length n. Delete its last entry, and 
relabel the remaining entries accordingly. The obtained 
permutation p 'is always a desarrangement of length n-1 , except 
when/>=zz(zzT)--21 and n is even. Conversely, each/dis obtained 
from n different n-desarrangements p this way, except for p =(w- 
l)(w-2)---21 when n is odd. The latter is only obtained from n -1 
desarrangements of length n as 1 could not be the last entry. 

(b) A routine computation shows that the numbers D(n) as given by 
Corollary 3.57 satisfy the same recurrence relation as the 
numbers J(n) has been shown to satisfy in part [(a)] of this 
exercise. As J f (l)=D(l)=0, the proof follows. 

43. The entry 1 of a derangement^ of length n can be part of a 2-cycle or a 
larger cycle. There are (n- 1) other elements it can form a 2-cycle with, 
and then the remaining n -2 elements can form a derangement in D(n- 2) 
ways. On the other hand, if 1 is not to be in a 2-cycle, then we can just 
insert it in any gap position of any (zz-1)-derangement (taken on the set 
(2, 3, 4,-", n }) except the one that would put this entry into its own 1- 
cycle. This provides (n- 1) D(n- 1) additional derangements. 

45. We will repeatedly use the triangular recursive relation c(m, k)=c(m- 1, k- 

1) +(m-l) c(m-l, k). Subtract n+k-l times the next-to-last row from the 
last row. Then the ith element of the last row becomes c(n+k, i)-(n+k- 
1 )c(n+k-\, i)=c(n+k-l, z'-l). Now subtract n+k-2 times row (k-2) from row 
k-1. This results in a row k-1 whose ith element is c(n+k- 1, i)-(n+k- 

2) c{n+k-2 , i)=c(n+k-2, z-1). Continue this way for all rows. We get the 
matrix 


/ c(n + 1,1) 
c(n + 1,0) 


c(n + 1,2) 
c(n 4- 1,1) 


c(n + l,fc) N 
c(n + 1, k — 1) 


\c(n + fe — 1,0) c(ti + k — 1,1) • • - c(n + k - 1, k - 1) J 


that is, a matrix whose first row is identical to that of C n , but in which the 
/th element of row z" is c(n+i-l,j-l), for z>l. Expanding this matrix with 
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respect to the first column and using induction (the (n-1 ) x (n-1) minor in 
the lower right corner is C H ), we get our claim. 

47. List the elements of [2n] in any of (2w)! ways, then insert bars after 
every two elements. This will result in a fixed point-free involution. On 
the other hand, we obtain each such involution n\-2" times as we can 
change the order of the elements within each pair, and we can change 
the order of the pairs without changing the involution itself. 

49. In such a permutation, each cycle length must be either one or three. 
Therefore, by Theorem 3.53, we get 


G n {x) = exp 



EjrE 

i>0 ' j>0 


w 


= E 


[»/3] 

E 


n\ “ 3 j j\(n - 3/)!, 


showing that g n (C) = 3 ijUn- 3 j)i . The reader is urged to find a 

combinatorial explanation for this formula. 

51. By repeated applications of the method seen in Example 3.64, we get 


G(x ) = cosh (a;) exp ( — ) cosh ( — ] exp ( — 


00 x 2n 


= ex p(E^)n- h 


i> 1 


X 


2i—i 


2 < — 1 


= Vrr?n™ h 

i> 1 


v.2i—1 


2 i - 1 


53. (This solution is due to Dennis White [30].) Let p £ ODD{2rn). Denote 
by Ci, Q,—, C 2t the cycles of p in canonical order. We construct a bijection 
$ : () D D{'2rn) — )■ EVEN (2m) as follows. For all i, 1 <i<k, take the last 
element of C 2f i, and put it at the end of C 2 , to get <D (p). For example, if 
p=( 4) (513) (726) (8), then O(^=(5134)(72)(86). Note that if is a 1- 
cycle, it disappears, and that the canonical cycle structure ol fi is maintained. 

To see that ti> is a bijection, it suffices to show that for all a g S n we can 
recover the only permutation p g ODD(2m) for which (&(p)=a. While 
recovering p, we must keep in mind that it might have more than h 
cycles, because some of its 1-cycles might have been absorbed by the 
cycles immediately after them. If the last value in c h is larger than the 
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first value in C H , then create a 1-cycle with this value, placing it in front 
of c h and repeat the whole procedure using and C H . Otherwise, move 
this value from c h to the end of c, A and repeat the whole procedure using 
ria and C h . 2 . If at any point only one cycle remains, create a 1-cycle with 
the last value in that cycle. It is then straightforward to check that the 
permutation p obtained this way fulfils 0(^)=cr. It also follows from the 
simple structure of <D that at no point of the recovering procedure could 
we have done anything else. 

55. By Theorem 3.53, the exponential generating function of these 
permutations is given by 

G g (x) = exp ( ~ 

I , n 

\n^rkr 



The argument of exp on the right-hand side has to be computed a litde 
bit differently from the special case of k= 2, which was covered in Corollary 
3.59. 

We have 



n^kr 



n>l n 


Therefore, 

G c (x) = exp i in(l - x) _1 - ^ ln(l - a;*) -1 ] = ^ ^ ^ —. 
Now note that 1 -:A=(Tjc) (l+x+^H—l-W), from which we get that 


G c (x)=( l+x-tWH—l-W) (3) 

Assume for shortness that n=mk. Applying the binomial theorem, we get 


(1 — = ]T(-1) 


n^km l k 


m> 0 


1 -k 
k 

m 


Y ( i) m x k ™ 1 1 ~ k l ~ 2k 

Tfl. fa 

m> 0 


= E 


.km 


k k 

1 k - 1 2k — 1 (m - 1 )k — 1 


771 >0 


ml 


k 


where n=mk. 
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So the coefficient of 2— = (f^yr in (1 — x *)* 1 k ^ k is 

(kn)l(k — 1 ) ( 2 A; — 1 ) • • • ((m - l)fe — 1 ) 
a ” rr\k m 

= 1 -•■■(*- 2 ){k - 1 ) 2 (k +1) ■ ■ • (2 k- 1) 2 (2 k + 1) • • • (n - 1). 

It follows immediately from (3) that a n is also the coefficient of a n /n\ in 
G c (x), in other words, a=gc(n). Finally, if n is not divisible by k, then the 
only difference is that the long product in our last formula has last term 
n, not re- 1 . 

57. Take a pair ( 71 -, k) € ODD(2m + 1) x [2m +1], and insert 2m+2 into 
the Mi gap position. Note that this implies 2m+2 cannot create a 
singleton cycle as it cannot go into the last gap position. Take away 
the cycle C containing 2m+2, and run <t> of the solution of Exercise 
53 through the remaining cycles. Then, together with C, we have a 
permutation in EV EN{2m). Run it through 4 ) 1 to get 
t(tt, k) € ODD(2m + 2). 

We claim that r is indeed a bijection. To get the unique preimage of 
7 r' € ODD(2rn + 2) under r, run n’ first through 4>. This way 2m+2 
gets into the first position of an even cycle, and therefore it indicates a 
gap position, which is not the last one, and thus we recover k. Remove 
2m+2, leave its cycle intact, and run the remaining even cycles through 
4 ) 1 to get r _ 1 ( 7 r')- 

59. This argument is due to Dennis White [30]. In this solution, we are using 
the bijection vji : ODD(2n)x[2n+l]—>ODD(2n+l) of the solution of 
Exercise 56. If the reader has not solved that exercise yet, he is urged to 
do so now. As a hint, he may use the solution of Exercise 57. The maps 
4> used in the solution of Exercise 56 and 57 are very similar. 

We are going to construct a bijection k from iSQ[2re)x[2re+l] onto 
5Q(2w+l). As the growth of |5Q{reJ| is equal to that of \ODD(n)\ when 
passing from 2 n to 2 w+l, we try to integrate the bijection v[i : 
ODD(2n)x[2n+l]—»ODD(2n+l) into k, by “stretching” the odd cycles 
part of our permutations. We proceed as follows. 

Let (- 7 r, k) € SQ(2n) x [2rz + 1]. Take n, and break it into even cycles 
part and odd cycles part, or, for short, odd part and even part. Let k 
mark a gap position in n. If this gap position is in the odd cycles part, or 
at the end of 7 1 , then interpret the gap position as a gap position for the 
odd part only, and simply run the odd part and this gap position through 
i]j to get k(n), together with the unchanged even parts. Note that 2re+l 
will appear in an odd cycle when we are done. 

If the gap position marked by li is in one of the even cycles, say c, we can 
think of it as marking the member of c immediately following it, say x. 
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Replace x by 2w+l in c. To keep the information encoded by x, we 
interpret x as a gap position in the odd part of n. Indeed, if x is larger 
than exactly i-l entries in the odd part, then let us mark the ith gap 
position in the odd cycles part. So now we are in a situation like in the 
previous case, that is, the gap position is in the odd part. 

Run the odd part and this gap position through v|i- Instead of inserting 
2w+l to the marked position, however, insert temporarily a symbol B, 
to denote a number larger than all entries in the odd part. Then decrement 
all entries in the odd part that are larger than x (including B) by one 
notch. The obtained odd cycles and the unchanged even cycles (except 
for the mentioned change in c) give us k(k). Note that 2w+l will be in an 
even cycle when we are done. 

Now we show that k is indeed a bijection. First, it is clear that k maps 
into 5Q(2w+l). Indeed, (n) and k(k) have the same number of cycles of 
each even length, so tt G SQ(2n ) implies K (n) G SQ{2n + 1). 

For example, let tz=(31) (65) (742) (8 ) and let k= 3. Then k marks the 
entry 6. So we replace 6 by 9, get the new even part (31) (95), and turn 
to the odd part, (742) (8). In it, the entry 6 marks the third gap position 
as it is larger than two entries, 2 and 4. So we have to apply to ((742) 
(8), 3 ). When we do that, first we get 4>((7 4 2)(8))= then we insert B 
into the third gap position to get (74) (B82). Now we decrease the entries 
larger than 6 by one notch: B to 8, 8 to 7, 7 to 6, to get (64) (872). 
Finally, we apply O' 1 to (64 ) and complete the of k( n), that is (4) (6) 
(872). With the previously obtained even part, this yields that k(ti)=( 
31) (4) (6) (872) (95). 

To get the reverse of k, take a permutation t;' G SQ(2n + 1), and locate 
2n+l. If it is in an odd cycle, then run the odd cycles through i|i '. This 
will yield an odd part one shorter, and an element of [2w+l]. Putting 
this together with the unchanged even part, we get ic 1 (n). 

If 2w+l is in an even cycle, then run the odd part through i|tThis will 
specify a gap position in the odd part, and so we recover the entry x. We 
point out that the only way we can get the given odd part of k(k) by is by this: x had 
to be the marked entry of the even part of n. Increment entries larger than x by 
one notch in the odd part. To get the even part, put x back to the place 
of 2w+l. The gap position immediately preceding 2w+l is our k in 
k 1 (k). 

61. The trials of Exercise 6 of Chapter 1 are not independent in the sense 
that Levy’s theorem requires them to be. That is, for Levy’s theorem to 
be applicable, we have to define what a success is in these trials. Clearly, 
a success has to be defined as the event that the ball currendy placed 
goes into a box that was previously empty (this is how the numbers of 
empty boxes will equal the Eulerian numbers). However, with that 
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definition, the trials are not independent as the probability of success on 
trial i does depend on the number of successes on the previous trials if 
i> 3. Therefore, Levy’s theorem does not apply, except when n<2. 


Solutions for Chapter 4 

1. Recall the notion of rank from Theorem 4.10. Generalize the 
SimionSchmidt bijection of Lemma 4.3 as follows. Instead of fixing just 
the left-to-right minima, fix all entries that are of rank at most k-2. Then 
proceed like in the Simion-Schmidt bijection. 

3. Let p be a permutation enumerated by <S„(132, 312)=a„. Let us say that 
n is in position i. Then all entries preceding n are larger than all entries 
that n precedes. Moreover, entries that n precedes are in decreasing 
order. This proves the recursive formula a n = V’”. 1 o,_i for n>2. with 
«i=l. Solving this recursion, we get a=2"- 1 . 

5. Up avoids 132, then all entries on the left of n are larger than all entries 
on the right of n. Furthermore, the subword on the left of n has to avoid 
123 as well, while the subword on the right of n has to avoid 132 and 
1234. These conditions together are sufficient. 

The reader is invited to prove that 5„(123,132)=2“' 1 . Now let b„=S„( 132, 
1234). Let now n be in position i of our permutation. Then it follows 
from the above that there are 2 l ' 2 b rtl ways for p to be (132, 1234)- 
avoiding if z>l, and b*.\ ways if f=l. This implies 

n 

b n — b n -i + 2 * “b n -i. 

i=2 

Solving this recursion, we get that b=F 2n , where F { is the zth Fibonacci 
number, that is, iq=0, Fj=l, and then F i+1 =Fp-F i . 1 . This result was 
proved in [198] by a different argument. We point out that [198] has a 
catalogue of results for sequences Sjp, q), where p is of length three, and 
q is of length four, as well as a general proof technique to obtain those 
results. 

7. We show that for ?z>3, we have g(n)-g(n-l)=n-2’ t3 . Indeed, there are g(n- 1) 
permutations that avoid both 132 and 4231 in which n is in the last 
position. If n is not in the last position, then each entry of L is larger than 
each entry of R. Moreover, L is a permutation that avoids 132 and 312, 
and R is a permutation that avoids 132 and 231. These conditions are 
sufficient. It then follows from Exercise 3 that the number of such 
permutations for each position of n is equal to 2 n ' 2 if n is not in the first 
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position, and to 2”' 1 if n is in the first position, proving our recursive 
formula. The statement of the exercise is then proved by induction. 

9. (a) The numbers r„4=5„(3142, 2413) are the famous large Schroder 

numbers. See [197] for a proof of the recursive formula 



It is easy to see from the above formula that the number r n also 
counts subdiagonal lattice paths from (0, 0) to (n, n) that use steps 
(0, 1), (1, 0), and (1, 1). This proves the recursive formula. 

n 

X n — r n —1 + ^ T j — D n—j • (4) 

i= 1 

From (4) we see that 

„ l — x — s/l — 6x + x 2 
> . r n x = -—-■ 


(b) There are nine other pairs (p, q) that are not trivially equivalent to 
(3142, 2413), but are still enumerated by the same number of 
^permutations. One of them is (1324, 1423 ) as was proved in 
[197] Another pair was found by Gire in [110]. Finally, it was 
D.Kremer who gave an exhaustive list of all ten pairs in [138]. 
The remaining eight pairs are (1234, 2134), (1342, 2341), (3124, 
3214), 5(3142, 3241), (3412, 3421), 5(2134, 1324), (3124, 2314), 
and finally (2134, 3124). We point out that the Schroder numbers 
many other contexts, and the interested reader should consult 
Chapter 6 of [180] for details. 

11. (a) This result was proved in [35]. Denote by/i(n)=5„(1324, 2413) the 
number of the permutations to enumerate, for shortness. It is 
obvious that if n is the leftmost entry, then the number of such 
permutations is h(n-l). Now let p be a (1324, 2413)-avoiding 
^permutation; suppose n is not the leftmost entry of p and let a be 
the smallest entry of p that precedes n. Then n precedes the entries 
1, 2, •••, a- 1. Furthermore, these a -1 entries must occupy the last a- 
1 positions (why?). 

So the last a -1 entries of p are the smallest ones, and so we can 
have A(fl-l) different strings on them. Let t(n-a+ 1) be the number 
of possible substrings on the first n-a+1 entries, in other words, 
t(i) is the number of (1324,2413)-avoiding w-permutations in which 
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n 



1 

FIGURE 9.14 

The strong block subdivision of p. 

the entry 1 precedes the entry n. In what follows, we are going to 
use this second interpretation of t(n) so as to alleviate notation. Set 
^(0)=0. Let T(x) = £i>i t(n)x n .. It follows from the above that 
permutations counted by t(n) are precisely the indecomposable 
(1324,2413)-avoiding w-permutations. It is then clear that H(x)= 1/ 
(1 -T(x)), and this includes even the case when n is the leftmost 
entry. Now we analyze the structure of permutations enumerated 
by the t(i) in order to determine E(x). 

Call entries before the entry 1 front entries, entries after the entry 
n back entries, and entries between 1 and n middle entries. Say that 
an entry x separates two entries y and z written in increasing order if 
jKxK z. 

The front entries must form a 132-avoiding permutation, the 
middle entries must form an increasing subsequence, and the back 
entries must form a 213-avoiding permutation. Similary, no front 
entry can separate two middle entries, or two back entries in 
increasing order; no middle entry can separate two front entries 
in increasing order or two back entries in increasing order; and no 
back entry can separate two middle entries or two front entries in 
increasing order. 

Therefore, the only way for two entries of the same category to 
be in increasing order is when they relate to any entries of the 
other two category the same way. Such entries are said to form a 
strong block. The strong block subdivision of a permutation counted 
by T(x) is shown in Figure 9.14. 

As we said, each strong block between 1 and n consists of an in¬ 
creasing subsequence, while strong blocks in the front are 
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132-avoiding permutations, and strong blocks in the back are 231- 
avoiding permutations. Permutations satisfying all these conditions 
do avoid both 1324 and 2413. 

Now for i >2 let Vu be the number of those permutations counted 
by t(i) containing no middle strong blocks, except for 1 and i. So 
£ 4 = 1 , £> 2 = 2 , £ 4 = 6 , Let R(x) = i t be the generating 
function for the v r Then clearly T(x ) = -• 

Note that v lA is just the number of ways to partition the interval 
{2, 3, ■■■, z-1] into disjoint intervals, and then taking a 213-avoiding 
or a 132-avoiding permutation on each of them alternatingly. It is 
not hard to see by a lattice path argument that this means that 
Vi = [ 2 *I?) so R{x) = v 7 r L fe. Therefore, 

x £\Cl — 4a; 

1 — R(x) fl — ix — x ’ 


which implies 


H(x) 


1 

1 - T(x) 


(Vl — 4a; — a;) (v/l — 4x(l — a;) + x) 
(1 — 4x)(l — x ) 2 — x 2 
1 — 5x + 3x 2 + x 2 s/l~- 4x 
1 — 6 x + 8 x 2 — 4x 3 


(b) There are five nontrivially equivalent pairs (p, q) so that S„(p, q)= 
h(n) for all n. Of the remaining four, the pair (1324, 2143 ) was 
found in [122], the pair (3214, 4123 ) was found in [174], and the 
pairs (1342, 2314) and (1342, 3241) were found in [35] 


13. This problem has been solved in [121], where the authors showed that 


^/ n (2143)x n 

n>0 


1 - x - vT~- 2x — 3a ; 2 
2x 


( 5 ) 


The numbers /„(2143) are called the Motzkin numbers, and occur in 
numerous combinatorial problems. See [180] for an extensive list of 
these problems. 

The authors of [121] showed the above formula by finding a bijection 
between these involutions, and 1-2 trees. A 1-2 tree is a plane tree in 
which each vertex has 0, 1, or 2 children. However, in contrast to 
decreasing binary trees, if a vertex has a single child, that child is direcdy 
below its parent, not on its left or right. See Figure 9.15 for an example. 

It is not difficult to see that the numbers of these trees satisfy the 
recurrence M n = M n ^i + YX=t M kM n ~ 2 -fr from which (5) follows 
for /„(2143)=M„. 
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FIGURE 9.15 

A 1-2 tree. 


15. As a 132-avoiding permutation is indecomposable if and only if it ends 
with its largest entry, the number of such ^-permutations is C k4 . Let our 
three blocks end in positions i, j, and n. With these restrictions, we 
clearly have C rl4 C„ :;4 permutations with the desired property. Now we 
have to sum this expression for all possible i and j to get the total 
number of 132-avoiding w-permutations having exacdy three blocks. 
We do this in two steps. First, fix i, and compute the sum 


n 1 



)=i +1 


Second, we sum for all possible i to get that there exist 


n —2 



i=l 


permutations with the desired property, as long as ?z>3. 

17. Let p be a 231-avoiding n-permutation. Let us call p dually decomposable if 
it can be cut into two parts so that each entry before the cut is smaller 
than each entry after the cut. For example, 21543 is dually decomposable. 
If p is dually decomposable, then say that its first cut is after i entries 
(l<i<n-l), and that p=LR, where L is the substring of the first i entries. 
Then we define the northeastern lattice path f(p) recursively, by taking 
the path f(L) from (0, 0) to (i, i), and continuing it with a translated copy 
of the path f(R) from (i, i) to (n, n). 

We still have to define/for 231-avoiding permutations that are not dually 
decomposable. These permutations start in their entry n (otherwise they 
would have a cut immediately before the entry n). Let p be such a 
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permutation, and let p’ be the (w-l)-permutation obtained from p by 
omitting the entry n. Then define f(p) as the concatenation of the step (0, 
0 ) to (1, 0), a translated copy of f(p) from (1, 0) to ( n, n-1 ), and the step 
( n, n-1 ) from (n, n). 

It is straightforward to prove again by induction that this recursively 
defined map /"is indeed a bijection. By induction again, the number of 
ascents of/> is equal to the number of north-to-east turns of f(p). 

19. Yes. If/? is increasing and q is decreasing, then all permutations of length 
at least |/>|-|<y|+l must contain at least one of p and q. This is a famous 
result of Erdos and Szekeres, and we proved it in Proposition 6.36. 

21. As 231 and 312 are inverses of each other, and the inverse of an even 
permutation is even, the first equality is straightforward. 

A permutation is 231-avoiding if and only if its reverse is 132-avoiding. 
On the other hand, reversing a permutation is the same as multiplying 
it with the transpositions (In), (2w-l), •••. The number of these 
transpositions is [nJ2\, and our proof follows from the result of the 
previous exercise. This result first appeared in [171], in a slighdy different 
form. 


23. We claim that for ri> 1, S, ,(123,132, 2l?>)=F„ +] , where the F„ are the well- 
known Fibonacci numbers, starting with F 0 = 0, and Ff= 1, and then given 
by F n+1 =F„+F„_ t . This result was first mentioned in [171]. Indeed, in a 
permutation p enumerated by 6’„(123, 132, 213), the entry n must be in 
either the first or second position, otherwise p could not avoid both of 
123 and 213. If n is in the first place, then, by induction, we have F„ 
possibilities for the rest of p. If n is in the second place, then n -1 must be 
in the first place, otherwise a 231-pattern is formed. Then, by induction 
again, we have F rA possibilities for the rest ofp. This shows that *S„(123, 
132,213)=A„ +1 . It is well-known, and can be proved by routine generating 
function techniques, that 


F n = 


1 

7 ! 




n+1 


i 

7 ! 


'l - v/5 N 


n+1 


< 1, the first term is dominant, showing that we have 

ys„(123,213,132) = (+^). 

25. The pattern 132456---A’ can play the role of q. The proof is similar to 
that of Theorem 4.18. However, instead of simply defining a strong 
class by its left-to-right minima and right-to-left maxima, we also have to 
fix the value of the position of entries that become right-left maxima if 
the original right-to-left maxima are removed. We then have to iterate 
this procedure n-4 more times. 


A*¥ 
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27. (a) Let p be a 132-avoiding w-permutation in which the entry n is in 
position i. Then the binary plane tree 'f(p) will have a left subtree 
of i -1 nodes and a right subtree of n-i nodes. The rest of the tree 
is constructed recursively by the same rule. The map 7 is a Injection 
as the position of n m.p can be read off T(p) as the size of the left 
subtree of T(p) plus one. The position of the other vertices can 
then be found recursively, noting that if n is in position z, then the 
set of entries of the left of n must be {z'+l, z+2, •••, w}, and the set 
of entries on the right of n must be [z-1]. It is here that we use the 
fact that p is 132-avoiding. 

Note that while T(p) is an unlabeled tree, each node of T(p) is 
naturally associated to an entry of p. Nevertheless, T(p) is unlabeled 
as writing these entries to the nodes would not carry any additional 
information. 

(b) It is straightforward to prove by induction that d(p) is the number 
of right edges (that is, edges that go down and right, or, if you like, 
from northwest to southeast). 

(c) We claim that pi>p,+i if and only if the vertex corresponding to P, 
appears on a higher level in T(p) than the vertex corresponding to 
P i+ 1 - This is obvious if n= 2. Now assume our claim is true for all 
integers less than n. Then if/;, and P i+i are on the same side of the 
entry n, then the corresponding vertices are in the same (left or 
right) subtree of ' T(p), and our claim follows by induction. 
Consecutive entries cannot be on two different sides of n, so the 
only remaining case is when one of p, and p i+l is equal to n. That 
case is trivial, however, as that vertex will correspond to the root 
itself. 

29. The number of 1234-avoiding w-pcrmutaiions is equal to the number of 
strong classes of w-permutations. On the other hand, the number of 
1324-avoiding w-permutations is asymptotically more than that. Indeed, 
for ?i>7, it is very easy to construct strong classes that end in the class 
3*1 *7*5 by concatenation. These classes contain at least two 1324- 
avoiding permutations. On the other hand, they constitute at least a 
constant factor of all strong classes, implying our claim. 

31. Let p be a pattern of length k starting with 1 and ending with k so that 
Sntp^C 1 holds for all n, for some constant C. Then we claim that 
5w(^j)<(4cC)”, thus we can set K=AcC. Take an n-permutation ji which 
avoids q. Suppose it contains q (this will only exclude c” permutations). 
Then consider all copies of q in our permutation and consider their 
entries x. Color these entries red. Clearly, the red entries must form a 
permutation which does not contain/;. For suppose they do, and denote 
Xi and x t the first and last elements of that purported copy of p. Then the 
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initial segment of the copy of q which contains x, and the ending 
segment of the copy of q which contains x k would form a copy of q\ 

Now remove all the red entries from n, to get n Then n ’ must be q- 
avoiding as all copies of q in n lost their entries playing the role of x. 
Therefore, there are at most C possibilities for the permutation of the 
non-red entries. There are at most 2"~' choices for the positions of the 
red entries, at most 2' h] choices for the values of the red entries, and at 
most C"' 1 choices for the permutation on the red entries. This shows that 
less than (4Q■ ("+ c”<(4Cc)” permutations of length n can avoid q’. 

33. It suffices to prove that every ^-avoiding w-permutation p can be 
extended into a ^-avoiding (n +l)-permutation by prepending it with a 
new first entry, in k-1 different ways. Let i be the first entry of q. Then 
we can prepend p with any one of the entries 1, 2, ■■■, z-1, as well as any 
one of the entries n, n- 1, •••, n-k+i+1 without creating a copy of q. 
Indeed, the first entry of the obtained (w+1)-permutation would be 
either too small or too large to play the role of i in any copy of q, and, 
being the leftmost entry, it certainly cannot play the role of any other 
entry of q. 

35. These will be the trees in which going from the leaves up, each label is as 
large as it can possibly be, that is, the sum of its children plus 1. Indeed, 
in 123-avoiding permutations, the entries that are not left-toright 
minima form a decreasing sequence, meaning that each vertex will 
contribute to its own label. 

Note that this observation provides a bijection from the set of 
indecomposable 123-avoiding w-permutations to that of unlabeled plane 
trees on n vertices. 

37. The number of inversions is translated to an area. More precisely, let L 
be/(123"-w), the staircase lattice path. Cttf(p)=L’[or some 231avoiding 
n-permutation p. Then i(p) is equal to the area between L and L\ See 
Figure 9.16 for an example. This can be proved by induction, using the 
dual block decomposition of p explained in the solution of Exercise 17. 

We mention that the area statistic of these permutations leads to an 
interesting open problem. Let n be fixed, and let a k be the number of 
231-avoiding n-permutations p so that the area belo wf(p) is equal to k. 
Then it is conjectured in [183] that the sequence [a k ] is unimodal. It is 
also conjecture in [183] that unimodality remains true if the staircase 
Ferrers shape that f(p) is not allowed to enter is replaced by another self¬ 
conjugate Ferrers shape. 

39. Yes. We claim that for k> 2, we have »S„(123, (k-l)k---21 )<w 2(M) We prove 
this claim by induction on k. For ^=2, the statement is trivial, and for 
k= 3, the statement is true as S n { 123,231) = (”) + 1 < n 2 - 
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FIGURE 9.16 

The area between L and Z’=^/(53124) is equal to z(53124)=6. 


Now assume the statement is true for k, and prove it for k+ 1. It is 
clear that if a permutation p avoids both 123 and (k'T ) /<• --21, the 
substring jT obtained from ^ by omitting its right-to-left minima must 
avoid both 123 and (fc-2) ( kT ) • • -2 1. Note that/; can have at most two 
right-to-left minima as p is the union of two decreasing sequences. One 
of these two right-to-left minima must be the entry 1, and the other is 
the rightmost entry. We have at most n choices for the position of n, at 
most n choices for the rightmost entry, and, by induction, at most n 2t - k " 
3) choices for p’. Therefore, we have at most w-w-w 2(H) =w 2(4 ' 2> choices for 
p as claimed. 

41. It is proved in [2], along with many similar results on multiset 
permutations avoiding patterns of length three, that 


B \ (01,02,03) — 


«1 r «2 t j 
03 




«3 


02 


Solutions for Chapter 5 

1. We claim that Si 32 ,i(n) = See [33] for a proof. The main idea 

is the following. In a permutation enumerated by <S 132il (n), there is either 
one or no front entry that is smaller than a back entry. If there is one 
such front entry, then its position and size is very restricted. If there is 
no such front entry, then the single 132-pattern of the permutation is 
formed either by front entries only, or by back entries only. This leads 
to a recursive formula involving the numbers <S 132 ,i (n) and the Catalan 
numbers, and solving that recursion, we obtain the above explicit 
formula. 
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This classic problem was first solved by Cayley [58], who proved that 


f(n,d ) 


1 //!> I (I I 2\ /n 1\ 

n + d + 2 y d 4- 1 ) \ d J 


Recendy, Richard Stanley [182] gave a proof based on a bijection between 
these polygon dissections and Standard Young Tableaux. We will revisit 
that proof in Exercise 22 of Chapter 7. 

5. We prove the statement by induction on k. For k=l, we have S(n, 1)= 1, 
which is obviously /'’-recursive. Now assume the statement is true for k- 
1, and prove it for k. We have seen in Exercise 8 of Chapter 1 that 

S(n, k)-kS(n- 1, k)=S(n -\, k-\). 

Let Sfx) — J2 n >k S(n,k)x n . Multiplying both sides of the above 
equation by xf, and summing for all n=k, we get 

Sk{x){l - kx) ~ xS k -i)(x) 


Sk(x) 


xS k -i(x) 
1 - kx 


so Sfx) is the product of the d-finite generating functions xS k fx) and 
and is therefore d-finite. 

Note that it follows from the above that 

^ k ^ X ' (1 — x )(l — 2x) - ■ • (1 — kx)' 

7. Let A(x) be algebraic of degree a, and let B(x) be algebraic of degree b. 
Then A (x)" can be obtained as a linear combination of the power series 
1, A(x), A(x) 2 , A(x) al in which the coefficients are polynomials. The 
analogous statement holds for B(x) h . That means that any expression 
involving A(x) and B(x) and algebraic operations on them can be obtained 
using linear combinations of the A(x)‘ and B(x} with iAa-l and j<b-l, and 
the same algebraic operations that are used in the desired expression. 

9. On one hand ,f”(x)=f(x), so the dimension of the vector space spanned 
by the derivatives of/tis at most of dimension two, thereforey'is d-finite. 
On the other hand, assume that f is algebraic of degree d, that is, 

P 0 (x)+P 1 (x) sin(x)+ ••• +P d (x) sin rf (x)=0, 

with d being minimal. As Pfx) sin(ic]H- \-P i (x)s\n i (x)=Q infinitely many 

values of x, it follows that P 0 (x) must be the zero polynomial. So the 
above equation reduces to 

sin (x) (PI fxj-t -h P d sin'* 1 (x)=0 


2004 by Chapman & Hall/CRC 






Solutions to Odd-numbered Exercises. 


349 


As sinfxj is not identically zero, it follows that Pi (x)-\ -l-P^suP 1 (x)= 0 as 

a function, contradicting the assumption that the degree of sin(x) was d. 

Note that this proof depended on the fact that sin(x) has infinitely many 
roots. See [180], Chapter 6, for two proofs of the fact that <f is not 
algebraic. On the whole, however, it is often difficult to prove that a 
series is not algebraic. 

11. It is proved in Exercise 11 of Chapter 4 that 


1 — 5ar + 3a; 2 + t?sJ\ — 4a; 
1 — 6x + 8a; 2 — 4x 3 


H{x) = ]T S n (1342,2431)x" 


n>0 


In other words, the generating function H(x) of our sequence is algebraic. 
Indeed, if a(x)= 1-5X+3T and b(x)=\Sxr\-^-^ , then (b(x)H(x)-a(x)) 2 =x i (l- 
4x), and the algebraic property of H(x) follows. Therefore, H(x) is d- 
finite, proving that our sequence is P-recursive. 

13. A permutation is layered if and only if it avoids both 312 and 231. On 
one hand, it is straightforward to see that the condition is necessary. To 
see that the condition is sufficient, note that in permutations avoiding 
312, the entries after the maximal entry have to be in decreasing order, 
and in permutations avoiding 231, the entries after the maximal entry 
have to be larger than entries before the maximal entry. 

15. Induct on n. The claim is clearly true if n—1. Assuming it is true for n- 1, 
first consider c < ( n I 1 ) and let p g S n -i satisfy the lemma. Then 
pn (-_ ,S n works for such c. On the other hand, if J ) < c < (") then 
consider c' = c — (n — 1) < ("I 1 ). Pickp g S n -i with P copies of 21 
and none of 132. Then np £ S n is the desired permutation. 

17. A 321-avoiding w-permutation consists of two increasing subsequences, 
of sizes a and n-a. Therefore, the number of inversions in p is at most 
a(ji — a) < —■ This is indeed attainable, for permutations like 456123. 

19. (a) We claim that sp(3)=5. On one hand, 41352 is a 3-superpattern. 

On the other hand, there is no 3-superpattern of length four as a 
pattern of length four can contain at most four different patterns 
of length three. 

(b) The permutation 13610259487 is a 4-superp 

(c) Consider the grid shown in Figure 9.17. 

Then read the columns one after another starting with the leftmost 
one, going from the bottom up in each column. The obtained 
permutation is clearly a Csupcrpattcrn. Indeed, if we want to find 


2004 by Chapman & Hall/CRC 




350 


Combinatorics ofPermutations 


k-k+1 

k-k+2 


k 2 





k+1 

k+2 


2k 

1 

2 


k 


FIGURE 9.17 

A grid for a A-superpattern. 


a copy of q=q ■■■q, all we need to do is to choose the ryzth entry of 
the ith column for each i. 

21. (a) A permutation of length six has = 15 subwords of length four, 
so it cannot contain all 24 different patterns of length four. 

(b) A 4-superpattern has to contain both the increasing and the 
decreasing pattern of length four. A permutation p of length seven 
can only do that if it is the union of a 1234-pattern and a 
4321pattern, which intersect in exaedy one entry. If p has this 
decomposition property, then all of its subwords can be decomposed 
into an increasing and a decreasing sequence. Therefore, 3412 
cannot be a subword of p. 

23. For any k, the number of different /{-clement patterns contained in p is 
at most min (, k\) as there are (*) subwords of p that have k elements, 
and there are k\ different possibilities for the pattern of these subwords. 
One checks that < k\ if and only if Z=5, so we get that the number 
of different patterns in p is at most 1+1+2+6+24+56+28+8-1-1=127. 

25. No, this is not always true. As is shown in [158], the case of kF= 6 provides 
the smallest counterexample. In that case, the layered permutation having 
seven layers of length nJ7 each will have more copies of q than the 
permutation having six layers of length n!6 each. 

27. This result and its proof can be found in [3]. 

29. Set M 0 = 1, and at(*) = The number of such paths that first return 

to the line y= 0 at the point ( k, 0) is clearly M k2 M n . k if 2 <k<n, and M n . k if 
kF= 1. This leads to the functional equation 

M(x)=l +M(x)x+AP (x)£ 

hence 

, . 1 — x — \Ji — 2x — 3a: 2 
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In other words, M(x) is algebraic, and therefore, d-finite. 


Solutions for Chapter 6 

1. (a) The choice of i, j, k and yds clearly insignificant. Applying the 

Transition Lemma, we see that the four entries n- 3, n- 2, n- 1, and n 
are in the same cycle of/; if and only if n is the leftmost of the four 
of them in f(p). This obviously happens in 1/4 of all w-permutations, 
so the probability in question is 1/4. 

(b) The entries n- 3, w-2, n- 1, and n belong to different cycles o{p if and 
only if they are in increasing order in ftp). The latter happens in 1/24 
of all M-permutations, so the probability we are looking for is 1/24. 

3. We can again assume that the four entries in question are n- 3, n- 2, n- 1, 
and n. 


(a) This will happen if and only if the pattern formed by our four 
entries has two left-to-right maxima. The number of such 4- 
permutations is c(4, 2)=11, so the probability we are looking for is 
11/24. 

(b) Similarly, this will happen if the pattern of our four entries has 
three left-to-right maxima. The number of such permutations is 
e(4, 3)=6, so the probability in question is A. = A.. 

5. This exercise is very similar to Example 6.27. Let S be a ^-element 
subset of [n], and let X(p) be the number of A-cycles of p. Then the 
probability that the entries belonging to S form a A-cycle is (k-\)\. ( ra ~j fc ) ! , 
therefore this is the expectation of the corresponding indicator variable 
Xs. As there are such in dicator variables, we get that 

Em^ElXs) =(")•«=■= 


7. For obvious symmetry reasons, we have E(Z) = (”) /2. To compute 

E(J<?), introduce the indicator variables ^ defined for p= p\p 2 —p n and 
for all pairs of elements /</ by 



1 if Pi > Pj 
0 if not. 


It is then clear that Z = YLi K j %i,j. Furthermore, = E(Zf j) = 

1/2. There are several cases to consider when computing the expectations 
E(Z,jZk). The simplest is the case of the (”) ( n “ 2 ) /2 pairs when [i,j] and 
{k, 1} are disjoint. In that case, clearly E(Z,jZh ^=1/4. If i=k, butjW, then 


2004 by Chapman & Hall/CRC 





352 


Combinatorics of Permutations 


E(ZjZt,) = 1/ 3 as two of the six possible patterns for the triple Ppfii are 
favorable (the ones in which p, is the largest of the three entries). Similarly, 
we have E(ZjZ,) = 1/3 if/=/ but i^k. Each of these possibilities occure in 
2 (”) cases. Finally it can also happen that j=k, in which case E(ZjZ,i) = 1/ 
6 pattern Pppi has to be decreasing) or that i=l, in which case again, 
E(ZjZJ = 1/6 (the pattern pfpj has to be decreasing). Each of these 
scenarios occur ('!)) times. Therefore, we have 


E(Z 2 ) = J2 E ( z lj) + E 


i<j 






n — 2 
2 



and consequently, 

Var(Z) = E(Z 2 ) - E{Z) 2 

1 fn\ lfn\(n — 2\ 5 (n\ n 2 (n — l) 2 

~2\2/^4y2/\ 2 J + 3V3y 16 

/n\2n + 5 
= \2 ) 36 ‘ 

9. No, there is not. If there were, applying the hooklength formula for 
such a Ferrers shape, the numerator, being 20!, would be divisible only 
by 7 2 , while the denominator would contain three factors equal to 7. 

11. We prove that we can even create a tree that consists of a single path 
that satisfies at least one third of the constraints. 

These trees correspond to permutations of [n]. The generic constraint 
{fa, b), (c, d)} will be satisfied by eight of the possible 24 relative orders 
of these four elements as we can swap entries within the pairs, or we can 
swap the pairs. Let p be a random w-permutation, and let Xfp) be the 
indicator variable of the event that the tree defined by p satisfies the 
constraint X. Then E(X)=1I3, where the expectation is taken over all n- 
permutations p. Using the linearity of expectation for the indicator 
variables of all constraints, we obtain our claim. 

13. Let R be the range of X. By the definition of expectation, we have 
H = E(X) = E iP l X = i) = E iP ^ X ~ *1 + E iP ^ X = *1 

i£R i>afj, i<a/L4 

> iP[X — *] > a/x • P[X > an]. 

i>afx 

Dividing by aji, we get Markov’s inequality. 
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15. As p is an involution, its cycles are all of length one or length two. Let /„ 
denote the number of involutions of length n. Then the number of 
involutions of length n in which the entry 1 is fixed is Ai, whereas the 
number of involutions of length n in which the entry 1 is part of a 2- 
cycle is (mT)A 2 . This shows that 


E(Y) 


I n- 1 + 2(n — l)At-2 
in 


Finally, as it is easy to compute by the Exponential formula, or directly, 
In = £l=£ 2J (”)(2 i - 1)!!, where we set (-1)!! = 1. 

17. By symmetry, we have 


P 






i(p) 




We have seen in the solution of Exercise 7 that if Z(p)=i(p), then 
Var{Z) = ( 2 )^^- Therefore, a = sJV ar{Z) = 0(n 3 / 2 ), and 
our statement follows from Chebysev’s inequality by setting A = ca x/n'c 
for an appropriate constant c. 

19. If n is in a cycle of length more than 2, then deleting n, we get a 
derangement of length n- 1. If n is in a 2-cycle, then deleting its cycle, we 
get a derangement of length n- 2. This leads to the formula 


E(Y n ) = 


(n — 1 )D(n — 1) 
D(n ) 


E(Y n -1 + 1 ) + 


(n — l)D(n — 2) 

D(n) 


E(Y n _ 2 ). 


21. This fact was published in [63] without proof. Let rbe an w-permutation 
that consists of one cycle, and let £ be the indicator variable of the 
event that the vertices of Gp, q form a Hamiltonian cycle in the order 
given by r. Then 


k =0 x 7 
n + 1 
nl 


(n — k)l k\ 
nl nl 


n 


E 


1 

nl 


Indeed, first choose the k edges that will come from the permutation p, 
that specifies the values oip(i) for k distinct values of i. Then choose the 
remaining n-k values olp in (n-k)! ways. Similarly, the n-k edges of the 
Hamiltonian cycle r that come from q specify the values of q(i) for k 
distict values of i, then choose the remaining values in k! distinct ways. 
Using linearity of expectation, we get that 

T) - 4 - 1 

E(Z) = (n- l)lE(Z r ) = . 

n 
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23. The number of total cycles of p is equal to the number of its 1-cycles, plus 
the number of its 2-cycles, and so on. If Xk(p) denotes the number of A-cycles 
of p E S n and X(p) denotes the numbers of all cycles of p, then it follows 
from the result of Example 6.2 that 


E {X ) = ±E(X k ) = ± 1 -. 

k =1 fc=l n 

25. Our proof will be by induction on n. For w=l and re=2, the statement is 
vacuous, and it is straightforward to check that the statement is true for n= 3. 
Indeed, in the only nontrivial case, we get A l - k =A 2 k =ll2. Now assume that 
the statement is true for re-1, and prove it for n. Then in any SYT on A, the 
entry n is in one of the inner corners of A. The presence of A. does not have 
any influence on the occurrence of /l, A if z<m-1, so by the induction hypothesis 
(after removing the box containing re), we get that A lx =A 2 y=---=A n . 2 , x . So our 
claim will be proved if we show that A n . 2 ,\=A^. 

Let Xhc the set of SYT on X in which re-2 is a descent but n- 1 is not, and let 
The the set of SYT on X in which n- 1 is a descent but re-2 is not. It clearly 
suffices to show that X and T are equinumerous; we do so by presenting a 
bijection g: X—> Y. 

Let T £ X- We consider three separate cases. 

(a) When w-1 and n are in the same row, then we define g(T) by swapping re- 
1 and w-2 in T. Note that n-2 and n will be in the same row of g(l). 

(b) When re-1 is in a row that is stricdy below the row containing n, and w-2 
is in the same row as n, then define g(7) by swapping w-1 and re in T. 
Note that w-2 and w-1 will be in the same row o f g(T 

(c) In all other cases, w-2, w-1, and w are in all different rows. 

(cl) If w-2 is the north neighbor of w-1. In this case, swap w-1 and w to get 
g(T). So in g(T), the entry w-2 will be the north neighbor of w. 

(c2) If w-2 is the north neighbor of n, then swap w-1 and w again to get 
g(T). So w-2 is the north neighbor of w in g(V- 

(c3) If w-2 is an inner corner, then from top to bottom, our three 
maximal entries are either in order n, n-2 , w-1, or w- 2, n, w-1. We 
then obtain g(l) by ordering them, respectively, w-1, w-2, w, and w-1, 
n, n-2. Note that all three entries are in inner corners in g(T). 

Note that in all three subcases, the three maximal entries are in different 
rows of g(I). 
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This completely defines the map g: X—> T. To see that g is a bijection, we 
show that it has an inverse. Let U £ Y ■ By our remarks at the end of each 
case, we can establish (from the positions of the three maximal entries in U) 
which rule was used to create U, and our statement is proved. 

We mention that [180] contains a non-induction proof of the result we have 
just proved. 

27. Note that p has ri rising sequences if and only if p * 1 * 3 * 5 has rzT descents, or ri 
ascending runs. So replacing ri(p) by d(p )+1 in the result of the previous 
exercise, we get the probability that p ' is obtained by our shuffle. If we sum 
that equation over all p e S„ then we get 1 on the left-hand side. Multiplying 
both sides by a n , we get the identity of Theorem 1.8. 

29. As for any X C [n] all entries of X must have the same chance to be the 
index of the minimum element oi f(X). the size of F’must be divisible by the 
size of X, and this has to hold for all possible X. Therefore, |F| has to be 
divisible by the least common multiple of the numbers 1, 2, 3, —, n. This 
argument is from [51]. The authors then mention the well-known number 
theoretical fact [7] that this least common divisor is of size e"' 0 ^. 


Solutions for Chapter 7 

1. We know that permutations of length n that avoid 12 ■ ■ -k correspond to pairs of 
SYT of the same shape that have at most k-\ columns. Because of Theorem 
7.11, involutions of length n correspond to such pairs in which the two elements 
are identical. In other words, 

/ n (123---*) = £/ F , 

F 

where F runs through SYT on n boxes having at most k-1 columns. The 
proof is then immediate. 

3. These SYT bijectively correspond to northeastern lattice paths from (0, 0) 
to (n, n) that never go above the main diagonal. The bijection is given by 

reading the entries in an SYT in increasing order, and if entry i is in row 1, 
then taking step i of the corresponding path to the east, otherwise taking it to 
the north. 

5. We claim that P(p) must be of rectangular shape (10x8). Indeed, P(n) has ten 
rows, and its rows are of length eight or less, otherwise the first row would 
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be of length nine, implying that p contains an increasing subsequence of 
length nine. Therefore, the fifth row of P(n) is also of length eight. 

7. By the argument seen in the solution of Exercise 1, it suffices to find a 
formula for the number of all SYT on n boxes that have at most two columns. 
In such SYT, the entry n must be at the end of either column. If n is even, 
and our SYT is rectangular, then n must be at the end of the second column. 
This implies the recursive formulae 



2/„_i (123) if n is even, 



where in the last step we use the result of Exercise 3. The result is now 



9. Theorem 7.5, with its notations, shows that necessarily a x =k and 
aprapi — Ya=r-k. Thus necessarily fl 1 =a 2 ="-=a r =Aand a^<ik, otherwise there 
would be r +1 increasing subsequences of length k which are disjoint. This 
means that the size of the last column is m/,=r. Applying (7.1) with k variables 
instead of k-\ and fixing m^rwe a the proof exacdy as that of Theorem 7.4. 

11. In any SYT of shape F, the entry n has to be in one of the inner corners. So 
removing n from any such tableaux, we get a SYT of shape F’ for some F’ 
that is part of the summation. 

13. Let p n = qrf- h is clear that for any a and b, we have q’(a, b)>q(a, b) -1, as q=j 
is the only entry of q that moved back. Thus, the only values of (a, b) that 
could possibly cause problems are those for which^(a, b)=q(a, b) holds. Note 
that p,rf, so p(a, b)=q(a, b) cannot hold if b>jand a<n- 1. So if b>j, and a<n- 
1, then we have 


P(a, b)^q(a, b)-<q’(a, b). 


If b<q, then the reduction (q h qJ has no effect on q(a, b) as it is two elements 
larger than b that move. Therefore, we have 


p(a, b)<q(a, b)<q’(a, b). 


Finally, if b=n, then p(a, b)=q’(a, b)=n+l-a trivially, for any a. So we have 
seen thatp(a, b)^q’(a, b) always holds. 

15. Yes, there is. Let «i=13 12 10 14 8 11 6 9 4 7 3 2 1 5. Then let a i+1 be 
obtained from i by simply inserting two consecutive elements right after the 
maximum element m of «,> and giving them the values (m- 4) and (m- 1), and of 
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course, relabeling the other elements naturally. It is not hard to see that the 
a, consist of two decreasing subsequences, and that they form an antichain. 
This construction is due to Daniel Spielman and the author, [29]. 

17. An isomorphism between the two posets can be constructed using the idea 
given in the proof of Exercise 3 of Chapter 3. 

19. Let M be the largest entry in the set P u {u} U {c}- The statement can be 

proved by induction on the size of P, the initial case being obviously true. See 
[167], pages 95-97 for the details of the inductive step. 

21. This result is based on an observation from [74]. It is clear that the right- 
hand side is the number of 321-avoiding n-permutations. We claim that the 
left-hand side is the same, counted by the length k of the longest increasing 
subsequence. Indeed, if ;ris such a permutation, then risk(n) has at most two 
rows, and exactly k columns. There is exactly one Ferrers shape ^satisfying 
these criteria, and then the hooklength formula shows that 



and the proof follows. 

23. Consider the fraction on the right-hand side of (7.1). Its numerator is clearly 
n\, as it should be. In the denominator, the term (m lA )\ is the product of the 
hooklengths of all boxes in the last row of F. How about the term (m h . 2 + 1)! 
in the denominator? It is almost t he product of the hooklengths of all boxes in 
the next-to-last row of F. We must say “almost” because if m iA <sm k . 2 , then 
there will be one term missing from the m k . 2 + 1 terms (hooklengths) whose 
product is (m k . 2 + 1)!. This is because the rightmost box of row k-2 that has a 
southern neighbor will have a hooklength that is larger by 2 than the 
hooklength of its right neighbor. Therefore, the missing hooklength will be 
m k-2~m t . i+l. However, this is corrected by the appropriate term of the product 
in the brackets, that is, the term given by i=h-2 and j=k- 1. 

Continuing this way, we see that the product of the hooklengths in row i is 


(rrii - rrij + j - i) ■ (rm + k-i - 1)!, 




and the proof follows by taking products for i e[£-l]. 

25. Yes, P' n is a lattice. We prove this by induction on n, the initial case of n=l 
being trivial. Assume the statement is true for n- 1, and prove it for n. Let x= 
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x^—xn and y=y\ jjvjVnbe two elements of PL Let x’ (resp. y) be the (w-1)- 
permutation obtained from x (resp. y ) by removing the maximal entry n. Let 
v = x * 1 A y' and let z = x 1 V y'. Let X=n, and y = n, and assume without 
loss of generality that i<j. It is then easy to verify that inserting n into the jth 
position of v results in x A y, and inserting n into the ith position of z results 
in x V y. 

27. No, if ri>2, then is not complemented. For instance, let x=2134--w. Then 
y=n(n-l )—4 13 2 andjy =n(wT)--4312 both satisfy the requirements. 

29. No, /„ is not self-dual in general. For instance, if re= 4, then there are three 
elements in /„ that cover the minimum element, namely 1243, 1324, and 
2134. At the same time, there are only two elements that are covered by the 
maximum, namely 4231 and 3412. 

31. It is proved in [202] that the number of these posets is 

(1 + o(l))n! 2 3 

2y/e ' 

33. We have seen that i is a descent in Q(p) if and only if i is a descent of p. 
However, now p is an involution, so P(p)=Q(p). Therefore, the question is 
reduced to asking what the probability is that i is a descent of a randomly 
selected SYT on n boxes. As we have seen in Exercise 24 of Chapter 6, this 
probability is 1/2. 


Solutions for Chapter 8 

1. This is a classic algorithm for generating permutations, which was found 

independently by Johnson [128] and Trotter [190]. The proof is not difficult 
by induction on n, the initial case of n—2 being obvious. Now assume the 
statement is true for n- 1. Then one proves horn the definition that the 
algorithm will list the n\ permutations of length n so that the first n, the next 
n, the following n, and so on, will only differ in the position of n, while the 
subsequence of the entries horn [n- 1] will be unchanged within each of these 
w-tuples of permutations. Among the ^-tuples, these subwords will be changed 
according to the list of (w-1)! permutations of length n- 1 , generated by this 
same algorithm. 

3. Such an algorithm can be found in [83]. 
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FIGURE 9.18 

Passing 231 through two stacks. 


5. We have seen that for any n-permutation p, the image s(p) ends in the entry 
n. Iterating this argument (to the shorter string preceding n in s(p)), the image 
s(s(p)) ends in the string (w-l)w, the image E(p) ends in (n-2) (n-l)n, and so on, 
the image A 1 (p) ends in 23- -n, so must be the identity permutation. 

7. We claim that for n=. 3, these are the (re-2)! permutations of the form Snl, 
where S is any permutation of the set {2, 3, n- 1}. We prove this statement 
by induction on n. The initial case of re=3 is obvious. Assume we know the 
statement for n, and let p be an (w+1)- permutation that is not (w-l)-stack 
sortable. Eet p=L(n+\)R. Then s(p)=s(L)s(R){n+ 1) is not (w-2)-stack sortable. 
By our induction hypothesis, this means that s(L)s(R)=23--n 1 . As s(R) has to 
end in its largest entry, we must have s(R)=R=\, showing that p is indeed of 
the form L(n+ 1)1 as claimed. 

9. We have seen in Proposition 8.17 that a /-stack sortable permutation must 
always avoid the pattern 23---(/+2) 1. On the other hand, observe that 
Theorem 4.12 implies that 5„(234-"(/+2)l)=5'„(123-"(/+ 2))<(M-1) 2 ", where 
the last inequality was proved in Theorem 4.10. 

11. (a) The image of 231 is 123 as shown in Figure 9.18. 

(b) These are precisely the /-stack sortable permutations. Even more 
strongly, s,(p)=d(p) for all p. Indeed, it is easy to see by induction 
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on i that the entries of p will leave stack i in the order identical to 
s‘(p). 

13. An n-permutation is (w-2)-stack sortable except when it ends in the string nl. 
So W„. 2 (n, k) is just the number of (w-2)-permutations with k-\ descents, that 
is, W„. 2 (n, k)=A(n- 2, ft). As the Eulerian numbers are log-concave, our statement 
is proved. 

15. Let p=LnR, when L and R are allowed to be empty. If neither L nor R are 
empty, then let f(p)=f(L)nf(R). If L is empty, that is, p=nR, then let f(p)=f(R)n. 
Finally, if R is empty, that is, p=Ln, then let f(p)=nf(L). Use this same rule 
recursively to compute f(L) and f(R). 

17. Yes. Take the antichain A from the solution of Exercise 15 of Chapter 7, then 
take the reverse of all the permutations in it, to get the antichain A’. Then A’ 
consists of 321-avoiding permutations, that is, permutations containing an 
increasing subsequence of length at least nJ2. Now affix the entry 1 to the 
end of each entry of A’ to get the new antichain A”. 

19. The condition that p r =q, means that going through the stack has the same 
effect on p and on q. In other words, the movement sequences associated to the 
two permutations are the same. By this we mean the following. To each 
permutation of length n, we can associate a sequence of parentheses o/length 2 n, 
consisting of n copies of ")” (left parenthesis) and n copies of ")”, (right 
parenthesis) describing how the permutation passes through the stack. Each 
time an entry goes DOWN in the stack, we write a left parenthesis, and each 
time an entry comes UP, we write a right parenthesis. 

For example, sequence of parentheses of x=123 is 000, the sequence of 
parentheses of ;M321 is ((0)), while the sequence of parentheses of both 
p= 132 and <p=231 is Q(Q). 

So p and q satisfy the conditions of this exercise if and only if they have the 
same sequence of parentheses. We claim that this, in turn, is equivalent to 
the condition that T(p) and T(q) are the same as unlabeled trees. This is 
straightforward by induction on n, if we note that the size of the right subtree 
of T(p) and T(q) is k, where there are 2k parentheses (k left, k right) inside the 
pair of parentheses that ends last. Indeed, this is just the number of entries 
that were on the right of n, therefore entered the stack after n, but exited the 
stack after n. 

21. We have seen in Exercise 19 that b p just describes what effect the stack has 
on p. We have also seen that this effect only depends on the sequence of 
parentheses associated to p, or, in other words, the unlabeled tree obtained 
from T(p) by omitting its labels. As there are C„ such trees, we get that B has 
C n = Cn)/( n + 1) elements. 
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23. We have seen in Exercise 21 that going through a stack can effect an n- 
permutation in at most C„ ways, so an w-permutation can have at most C„ 
preimages. On the other hand, the identity permutation does have C„ 
preimages, namely all the 231-avoiding permutations. We claim this is the 
only permutation with that property. 

To see that no other n-permutation can have C„ preimages, we apply induction 
on n. For m< 3, the statement is clearly true. Now assume the statement is 
true for all positive integers less than n. If s(q)=p, and q=LnR, then we have 
p= s(L)s(R)n. Now keep n fixed in q, in position k, and change L and R so that 
s(q) does not change. Note that this means that the set of entries in L and the 
set of entries of R cannot change, as otherwise the set of the first k-\ entries 
in s(q) would change. Similarly, s(L), and s(R) have to remain unchanged. 

By our induction hypothesis, there are at most C H ways we can permute the 
entries of L without changing s(L), and there are at most C n . k ways we can do 
this with R, with equality holding only if both s(L) and s(R) are monotone 
increasing, that is, only if p is increasing. As k can range from 1 to n, this 
means that the number of preimages of p is at most 


n 





with equality holding only if/>=123-"W. 

25. (a) These are the permutations that avoid both 2431 and 4231. 

(b) It follows from part (a) and the characterization of ir-sortable permutations 
that p is ir-sortable if and only if (pf/yis or-sortable, proving our claim. 

27. An M-permutation p is separable if and only if it avoids both 2413 and 3142. 
First we show that this is necessary. Indeed, the statement is obvious for 
re= 4, and follows by induction on n for larger n. 

Now let p avoid both 2413 and 3142. Then clearly, the reverse of p also 
avoids these patterns, so we can assume without loss of generality that 1 
precedes n in p. Then all entries on the right of n have to be larger than all 
entries on the left of 1, or a 3142 pattern would be formed. If there is no 
entry on the left of n that is larger than an entry on the right of n, then p is 
separable, and we are done. Otherwise, let dbe the leftmost entry on the left 
of n that is larger than the smallest entry c on the right of n. Then dnc is a 
231-pattern, meaning that there cannot be any entry ^Kc located between d 
and n, for dime would be a 3142 pattern. So all entries located between d and 
n are larger than c, whereas all entries on the left on d are smaller than c, 
proving that p is separable. Indeed, p=LR, where the split occurs immediately 
before d. Then both L and R avoid 2413 and 3142, and therefore, are 
separable. 
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The concept of separable permutations was introduced in [42]. 

29. These are the permutations avoiding the pattern q={t+l)t---\. It is clear by 
the Pigeon-hole Principle that these permutations cannot be /-queue sortable. 
On the other hand, if p avoids q, then let us define the co-rank of an entry 
to be the length of the longest decreasing subsequence ending at that entry. 
Then p is /-queue sortable by sending all entries of co-rank i to queue i. 
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List of Frequently Used Notations 


A(n, k) 

number of w-permutations with k-1 descents 

c(n, k) 

the number of w-permutations with k cycles 

d(p) 

number of descents of the permutation p 

i(p) 

number of inversions of the permutadon p 

w 

number of involutions of length n avoiding the pattern q 

n! 

n(n- 1)--1 

[S3 

n! 

kl(n—k)l 

[n] 

{1, 2, n } 

[n] 

(f l +q n2 +-+ 1 

[n]! 

[n][n-l]-[l] 

[S3 

[n]t 

[k]![n-k]! 

(n) m 

n(n-l)---(n^m+l) 

p(n) 

number of partitions of the integer n 

S n 

set of all n-permutations 

$n(q) 

number of w-permutations avoiding the pattern q 

Sn,r(q) 

number of w-permutations containing exactly r copies of q 

S(n, k) 

number of partitions of the set [n] into k blocks 

s(n, k) 

(-1 )"' k c(n, k) 

4) 

image of the permutation p under the stack sorting operation 
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